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1. Introduction 

In this survey we discuss some interleaved strands of ideas connecting the items in the title. 
We do not, of course, develop all the connections between groups and automata. In particular, 
we do not consider either automatic groups (see, for instance, the monograph }3~J ] by Epstein, 
Cannon, Hold, Levy, Paterson, and Thurston) or automata groups, also called self-similar groups 



(including the well known Grigorchuk group of intermediate growth [371 . |44| : see, for instance, 
[39l [fl Q and the monograph 84 1 by Nekrashevych) . 



A finitely generated group can be described by a presentation G = (X; R) in terms of generators 
and defining relators. In this case, the group alphabet is S = X U X~ l . Anisimov Q introduced 
the fruitful point of view of considering the Word Problem of G = (X;R) as the formal language 
WP(G : X; R) — {w 6 X* : w — Iq}- Although the Word Problem is generally a very complicated 
set, Anisimov asked what one could say about the group G if WP(G : X; R) is a regular or context- 
free language in the usual sense of formal language theory. He showed that a finitely generated 
group has regular Word Problem if and only if the group is finite. An important class of groups 
is the class of virtually free groups, that is, groups having a free subgroup of finite index. Muller 
and Schupp [79[ showed that a finitely generated group has context-free Word Problem if and only 
if the group is virtually free. 

The basic geometric object associated with a finitely generated group G — (X;R), its Cayley 
graph r(G : X]R), was already defined by Cayley [l3| in 1878. Intuitively, an end (a notion due 
to Hopf [49( and Frcudenthal [32j) of a locally finite graph is a way to go to infinity in the graph. 
The number of ends of a connected graph T with origin vq is the limit, as n goes to infinity, of the 
number of infinite connected components of T \ T n , where the n-ball r„ consists of all vertices and 
edges on paths of length less than or equal to n starting at vq . The number of ends of a finitely 
generated group is the number of ends of its Cayley graph. (It is not obvious, but true, that this 
number depends only on the group and not on the particular presentation chosen.) The proof of 
the characterization of groups with context-free Word Problem depends heavily on the Stallings 
structure theorem [96| , which shows that finitely generated groups with more than one end must 
have a particular algebraic structure. 
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It turns out that the connection between ends and context-freeness is much deeper than just 
the case of groups. It is well-known 22, 4(| 5(| that a formal language is context-free if and only 



if it is the language accepted by some pushdown automaton. The concept of a finitely generated 
graph gives a common framework in which one can discuss both Caylcy graphs of finitely generated 
groups and complete transition graphs of various kinds of automata, in particular the complete 
transition graph of a pushdown automaton. 

Instead of considering the number of ends of a finitely generated graph T, one can consider 
the number c(T) of labelled graph isomorphism classes of connected components of T \ T n over 
all components and all n > 1. Say that T has ftnitary end-structure if c(T) < oo. Muller and 
Schupp [80j proved that a finitely generated graph has finitary end-structure if and only if T is 
isomorphic to the complete transition graph T(M) of some pushdown automaton M. 

One of the most powerful positive results about decision problems in logic is Rabin's theo- 
rem [8^| that the second-order monadic theory of the rooted infinite binary tree Ti is decidable. 
This theory, S2S, is the theory of two successor functions, as we now explain. We consider the 
infinite binary tree as the rooted tree with root vq and right successor edges labelled by 1 and 
left successor edges labelled by 0. The second-order monadic logic of Ti has variables ranging 
over arbitrary sets of vertices. We have two set-valued successor functions: if S is a set of vertices 
and a £ {0,1} then Sa = {va : v £ S}. There is also the relation symbol C for set inclusion 
and a constant symbol vq for the origin. There are the usual quantifiers V, 3 and the Boolean 
connectives A (and), V (or), and -i (negation). Some formulations include individual variables for 
single vertices, but sets with a single element are definable, as is equality. The great power of this 
language is that one can quantify over arbitrary sets of vertices. 

The characterization of graphs with finitary end structure shows that such graphs are "very 
treelike" . Indeed, such a graph T contains a regular subtree of finite index, in the sense that there 
is a subtree T defined by a finite automaton and a fixed bound D > such that every vertex in 
r is within distance D of some vertex in the subtree T. From this fact, it is possible to reduce 
questions about the monadic theory of T to questions about the monadic theory of the tree T. 
It then follows from Rabin's theorem that the monadic theory of the complete transition graph 
of any pushdown automaton is decidable. In particular, if G = (X;R) is any finitely generated 
presentation of a virtually free finitely generated group then the monadic second-order theory of 
its Cayley graph T(G : X; R) is decidable. There are finitely generated graphs which do not have 
finitary end structure but whose monadic theories are decidable. However, Kuske and Lohrey jpoj 
have recently proved that if the monadic theory of the Cayley graph of a finitely generated group 
is decidable then the group must be virtually free. 

There is an interesting application of the decidability of the monadic second-order theory of 
Cayley graphs of context-free groups to the theory of cellular automata on groups. The following 
definition is actually a straightforward generalization of von Neumann's concept (86j of cellular 
automata on the grid on integer lattice points in the plane, that is, the Cayley graph of Z 2 . Let 
G be a group and S a finite set and denote by E G the set of all maps a: G — > E. Equip E G with 
the action of G defined by 

g(a)(h) = a(.g _1 /i) for all a £ E G and g,h £ G. 

Then one says that a map C : E G — > E G is a cellular automaton provided there exists a finite 
subset M C G and a map [i : E M — > E such that 

C(a)( 5 )= A1 ((. 9 - 1 a)| A/ ) (1.1) 

for all a £ E G and g £ G, and where (-)|m denotes the restriction to M. One is often interested in 
determining whether or not a cellular automaton is surjective (respectively, injective, bijective). 
In particular, the following decision problem naturally arises: given a finite subset M C G and 
a map /x: E M — > E, is the associated cellular automaton C: E G — > E G defined in (jl.ll) surjective 
(respectively injective, bijective) or not? Amoroso and Patt [l| proved in 1972 that if G = Z the 
above problem is decidable. If follows from the decidability of the monadic second-order theory of 
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Cayley graphs of context-free groups that the problem for cellular automata defined over virtually- 
free groups is decidable. On the other hand, Kari [H^, IH^. IH3 | proved that if G = Z d , d > 2, this 
problem is undccidable. His proof is based on Berger's [7[ undecidability result for the Domino 
Problem for Wang tiles. 

In 1960 Biichi proved that the monadic theory of N with one successor function, SIS, 
is decidable by introducing finite automata working on infinite words. Monadic sentences are 
too complicated to deal with directly and the idea is to effectively associate with each monadic 
sentence </> a finite automaton such that <j> is true if and only if the language L{A r f ) ) accepted by 
A(j> is nonempty. Of course, one must carefully define what it means for an automaton to accept an 
infinite word. Rabin used automata working on infinite trees to establish a similar correspondence 
between sentences of S2S and the Emptiness Problem for tree automata. 

The theory of automata working on infinite inputs is thus crucial to studying monadic theories, 
but proving theorems about such automata is difficult. The best way to understand such automata 
is in terms of infinite games of perfect information as introduced by Gale and Stewart [34j . Let 
S be a finite alphabet and let S N denote the set of all infinite words w = a\a-i ■ ■ ■ a n ■ ■ ■ over 
E (all the infinite words which we consider are infinite to the right). Let W be a subset of S N . 
We consider the following game between Player I and Player II: Player I chooses a letter ci £ £ 
and Player II then chooses a letter <i2 G S. Continuing indefinitely, at step n Player I chooses a 
letter a^n-x £ £ and Player II then chooses a letter U2n £ £• This sequence of choices defines 
an infinite word w € £ N . Player I wins the game if w £ W and Player II wins otherwise. The 
basic question about such games is whether or not one of the players has a winning strategy, that 
is, a map <j> : E* — > £ such that when a finite word u has already been played, the player using 
the strategy then plays 4>(u) £ S and always wins. Using the Axiom of Choice, it is possible to 
construct winning sets such that neither player has a winning strategy, but this cannot happen if 



the set W is not "too complicated" . An important theorem of Martin [70J, [71| shows that if the 
set W is a Borel set then one of the two players must have a winning strategy. 

To apply infinite games to automata, given an automaton M one defines the acceptance game 
G(M, w) for M on an infinite input w £ E N . The first player wins if M accepts w while the second 
player wins if M rejects. In the case of automata, the winning condition of the acceptance game 
is at the second level of the Borel hierarchy so one of the players has a winning strategy. This 
essentially proves closure of under complementation of regular languages in E N . The situation 
is similar for automata on the binary tree. The celebrated "Forgetful Determinacy Theorem" of 
Gurevich and Harrington 42| states that a fixed finite amount of memory, the later appearance 
record, is all that a winning strategy needs to take into account. 

The paper is organized as follows. In Section [2] we review the notions of regular, context-free, 
and computably enumerable languages together with the parallel notions of grammars and their 
associated classes of automata: finite-state automata, pushdown automata, and Turing machines. 
Section [3] is devoted to presentations of finitely generated groups and their associated Cayley 
graphs. We consider the Word Problem for a finitely generated group as a formal language. We 
prove Anisimov's characterization of groups with regular Word Problem and present the Mullcr- 
Schupp characterization of groups with context-free Word Problem. We also discuss some applica- 
tions of formal language theory to subgroups and present Haring-Smith's characterization of basic 
groups in terms of their Word Problem. In Section @] we consider the notion of a finitely generated 
graph and the number of ends of a finitely generated graphs together with Stallings Structure 
Theorem and the notion of accessibility. We then consider the notion of finitely generated graphs 
with finitary end-structure and their characterization as complete transition graphs of pushdown 
automata. Section [5] is devoted to second-order monadic logic where we discuss Biichi's theorem 
on the decidability of second-order monadic theory SIS and Rabin's theorem on the decidability of 
second-order monadic theory, S2S, of the infinite binary tree. We then discuss the decidability of 
second-order monadic theory for complete transition graphs of pushdown automata. We consider 
the classical Domino Problem and its undecidability due to Bcrger and Robinson. After general- 
izing the Domino Problem to finitely generated groups, we show that it is decidable for virtually 
free groups. In Section [5] we consider the Surjectivity, Injectivity, and Bijectivity problems for 
cellular automata on finitely generated groups and its decidability for virtually free groups. The 
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last section is devoted to finite automata on infinite inputs and the work of Biichi, of Rabin, and of 
Mullcr and Schupp. We then discuss infinite games of perfect information, the theorems of Davis 
and Martin, and the Forgetful Dctcrminacy theorem of Gurevich and Harrington. 



2. Languages, Grammars, and Automata 

2.1. The free monoid over a finite alphabet 

Let E be a finite alphabet, that is, a finite set of letters. A word on E is any element of the set 



E* = |J E", 



where E n = {0102 ■ • • a n : E S, 1 < k < n}. The number \w\ = n is the length of the word 
w = a\02 ■ • ■ a n . The unique word of length zero is denoted by e and is called the empty word. 

The concatenation of two words w = a\02 ■ • • a n E E n and w' = a\a! 2 ■ ■ ■ a' m E E m is the word 
ww 1 E S n+m defined by 

WW = a\02 ■ ■ ■ a n a'ia 2 ■ ■ ■ a' m . (2-1) 

We have sw = we = w and (ww')w" = w{w'w") for all w, w', w" E E*. Thus, E* is a monoid under 
the concatenation product with identity element the empty word e. The monoid E* satisfies the 
following universal mapping property: if M is any monoid, then every map / : E — Y M uniquely 
extends to a monoid homomorphism (p: S* — > M. Due to this property, E* is the free monoid 
over E. 

Let u, w be two words over E. One says that u is a subword of w if there exist Ui,U2 E E* 
such that w = U\UU2- 

A language over E is a subset L c E*. 

Context-free languages 



In this section, we discuss the class of context-free languages introduced by Chomsky [24j . 

A context-free grammar is a quadruple 5 = (V, E, P, 5*0), where is a finite set of variables, 
disjoint from the finite alphabet E of terminal symbols. The variable So E V is the start symbol, 
and P C V x (V U E)* is a finite set of production rules. We write S 1 h u if (S 1 , it) E P. For 
v, w S (V U E)*, we write v u> if w = v\Sv2 and w; = V1UV2, where u,v\,V2 E (V U E)* and 
Shu. The expression ij ==>■ if is a single derivation step, and it is called rightmost if «2 E E*. 
A derivation is a sequence u = wq, w\, . . . , w n = w E (V U E)* such that Wi =>■ Wj+i for each 
i = 0, . . . , n — 1 and we then write v ==>■ w. A rightmost derivation is one where each step is 
rightmost. It can be easily shown that if v =>■ w with ui E E*, then there exists a rightmost 
derivation v =>- w. For 5 £ K, we consider the language L5 = {w E E* : S w}. The language 
generated by Q is 

:=L So = {weZ* :S ^w}. 
A context-free language is a language generated by a context-free grammar. 

Example 2.1 (Dyck's language). The language of all correctly balanced expressions involving 
several types of parentheses is in some sense the "primordial" context-free language. Let n > 1 
and E = {ai, a\, . . . ,a n , a n }. Consider the grammar Q with one single variable 5*0 and productions 
So h e and So h aiSociiSo, i = 1, . . . , n. The language L({?) generated by the grammar Q is called 
the Dyck language. Thinking of the a^'s (resp. a^'s) as n different "open" (resp. "closed") 
parenthesis symbols, then L{Q) consists of all correctly nested parenthesis expressions over these 
symbols. For example, 

So I - a 2So&2So = r h a2Soa,2 a iSo(iiSo =^ ^So^fliSoai 

=> a2Soa2aia2Soa,2Soai => a 2 5oa2aia2S'oa2ai a 2 S'oa2aia2a2ai 
a 2 a2ai02a2ai 

is the unique rightmost derivation of 020201020201 E L(Q). 
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A context-free grammar Q = (V, E, P, So) and its associated language L{Q) are called linear if 
every production rule in P is of the form S h v\Tvi or S h d, where v, i>i,t>2 S E* and S 1 ,! £ F. 
If in this situation one always has vi = e (the empty word), then the grammar and language are 
called right linear. Similarl y, t he grammar and language are left linear if one always has v\ = e. 
It is well known (cf. [22, 4f| 5(J) that both left linear and right linear grammars generate the same 



class of languages, namely, the class of regular languages. 

Example 2.2 (Palindromes). Let E be a finite alphabet. A word w = a\ai ■ ■ ■ a n is a palindrome 
provided that cij = a„_.; + i for all i = 1,2, ... ,n, that is, w is the same read both forwards and 
backwards. We denote by L pa i(E) the language consisting of all palindromes over the alphabet E. 
For example, L pa i({a}) = {a}* = {e, a, aa, aaa, . . .} and 

L pa \{{a, b}) = {e, a, 6, aa, bb, aaa, aba, bab, bbb, aaaa, abba, baab, bbbb, . . .}. 

Consider the grammar Q with a unique variable So and productions of the form So \~ e, So h a 
and So I - aSoa, for each a G E. Then Q is a linear grammar and L(Q) = L pa i(E). It follows that 
the language consisting of all palindromes is linear. 

Example 2.3 (The free group). Let A be a finite set and denote by Fx the free group based on 
A. (If n denotes the cardinality of X we shall also denote Fx by F n and refer to it as to the free 
group of rank n.) Let X -1 be a disjoint copy of X and set E = IU A -1 . We denote by x n> x^ 1 
the involutive map on E exchanging X and A" 1 so (a;™ 1 ) -1 = x for all x G A. A word w € S* is 
reduced if it contains no subword of the form xx~ 1 or x~ 1 x for x G A. For example, if x,y G A 
are distinct, then the words e, a;, xy, xy~ x , xy~ 1 x~ 1 arc reduced, while xx~ 1 ,x~ 1 xy are not. We 
denote by L rc d(E) C E* the language consisting of all reduced words. It is well known that every 
element of Fx has a unique representative as a reduced word in L re( j(S). 

Consider the grammar Q = (V, E, P, So) where V = {So} Li {S x : x £ T,} and P consists of the 
productions of the form 

So h e and So \~ xS x for all x G E 



and 



S x h e and S x h yS y for all y G E \ {.t x } 



for all x G E. Note that <7 is a right-linear grammar and that L(Q) = L re d(E). Thus, the language 
of all reduced words over E is regular. 

Returning to a general context-free grammar Q, for a given variable S G V, we define the degree 
of ambiguity, ds{w), of a word w G E* as the number of different rightmost derivations S =>■ u>. 
We have ds(w) > if and only if w G £5. The grammar is called unambiguous if ds {w) = 1 
for all w G L(Q). Otherwise, if there exists w G L{Q) such that ds {w) > 1, the grammar is 
called ambiguous. A context-free language L is called unambiguous if it is generated by some 
unambiguous grammar and inherently ambiguous if all context-free grammars generating L are 
ambiguous. It is a fact that there exist inherently ambiguous context-free languages (cf. [50(). 

2.3. Growth of context-free languages 

Let E be a finite alphabet and L C E* a language. 

The growth function of L is the map 7^ : N — >■ N defined by 

1l{ti) = \{w G L : \w\ < n}\, n G N. 

Note that 

™ \y\n+l _ I 

7i(n) < 7s* («) = E I s !" = ivi _ 1 ^ ^ = 
fe=0 ' ' 

for all n G N where C > | E | . It follows that there exist C > and a > 1 such that 

7i (n)<Ca™ (2.2) 



G 



for all neN. 

The growth rate of L is the number 



X(L) = limsup \{w e L : |w| < n}|™ . (2.3) 

n— >oo 

On says that L is of exponential growth if A(L) > 1. Otherwise, if X(L) = 1, then L is of sub- 
exponential growth. Note that L is of exponential growth if and only if there exists a > 1 such that 
lL,{n) > for all n 6 N. A language L is said to be of polynomial growth provided that there 
exist an integer d > and a constant C > such that (n) < C + Cn d for all n GN. Finally, one 
says that L is of intermediate growth if its growth is sub-exponential but not polynomial. Note 
that a language cannot be of "super-exponential growth" by virtue of (|2.2j) . 

Bridson and Oilman fiol ] and, independently, Incitti [HH ] , proved that the growth of a context- 
free language is either polynomial or exponential. An explicit algorithm for determining this 



alternative is presented in [14|. On the other hand, Grigorchuk and Machi [38[ presented an ex- 
ample of an indexed language of intermediate growth. (The class of indexed languages, introduced 
by A. Aho, properly contains the class of context-free languages and, in turn, is properly contained 
in the class of computably enumerable languages.) 
One says that the language L is growth- sensitive if 

\{L F ) < X(L) 

for every non-empty FcE* consisting of subwords of elements of L, where 

L F = {w E L : no v G F is a subword of w}. 

It is a well known fact, which can be deduced from the Perron- Frobenius theory (see [l8| for an 
alternative proof), that regular languages are growth-sensitive. Ceccherini-Silbcrstcin and Woess 
[l9L [TBI ] (see also 0) extended this result to all unambiguous ergodic context-free languages. 
(Here "erg odicity" corresponds to strong connectedness of the dependency graph (in the sense of 
Kuich [59|) associated with an unambiguous context-free grammar generating the language.) 



2.4- Finite automata 

A nondeterministic finite automaton is a 5-tuple A = (Q,Ti,S,qo, F) where Q is a nonempty 
finite set of states, E is a finite alphabet, qo € Q is the initial state, F C Q is the set of final states, 
and the map 

5: Q xE^P(Q) 

is the transition function. (As usual, V{Q) denotes the set of all subsets of Q.) The automaton 
works as follows. When reading a word w € E*, letter by letter, from left to right, it can change 
its state according to the transition function. A run of A on a word w = <J\o~<z ■ ■ • o~ n is a function 
p : {0, 1, . . . , n + 1} — > Q such that p(0) = qo and p(i + 1) € 5(p{i), di) for i = 0, 1, . . . , n. A word 
w = oio~2 • • • o~n € E* is accepted by A if there exists a run p of A on w such that p{n + 1) S F. 
In short, A accepts w if there is a sequence of choices allowed by the transition function such that 
A is in a final state after reading the word w. The set of all words iiieE' accepted by A is called 
the language accepted by A and it is denoted by L(A). 

The automaton A is said to be deterministic if \S(q, a)\ < 1 for all q £ Q and a € E, where | • | 
denotes cardinality. 

The following is a fundamental characterization of regular languages (see, e.g. (22I l46l [HojV 

Theorem 2.4. Let E 6e a finite alphabet and L C E* &e a language. Then L is regular (that is, it 
is generated by a left-linear (equivalently, by a right-linear) grammar) if and only if it is accepted 
by a deterministic finite automaton. 

Example 2.5 (The free group). Let A be a finite set with E = X U A -1 and the map x n> x^ 1 
as in Example 12.31 Let A = (Q,T,,S,qo,F) be the finite state automaton with state set Q = 
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{<?()} U {q x ■ x G E}, F = Q (all states are terminal), and where the transition function is defined 
by 

S(q ,x) = q x 

q y if y ^ x^ 1 



${qx,y) 



otherwise. 



for all x, y G E. It is immediate to see that the language accepted by the automaton A consists of 
all reduced words over the alphabet E, that is, L(A) = £ rc d(E). 

Graphically, one represents a finite automaton A = (Q,T,,5,qo,F) as a labelled graph. (See 
Section EH] for more on labelled graphs). The vertex set is Q and, for every p G Q and a G E, 
there is an oriented edge from p to q, with label a, for all q G 5(p, a). The initial state is 
denoted by an ingoing arrow into it and a double circle is drawn around each final state. In 
Figure [1] we represented the automaton A recognizing the language L ro d(E) of reduced words on 




2.5. Pushdown automata 

A pushdown automaton is a 7-tuplc M. = (Q, E, Z, S,qo, F, zq), where Q is a nonempty finite 
set of states, E is a finite alphabet, called the input alphabet, Z is a finite set of stack symbols, 
go € Q is the initial state, F C Q is the set of final states, and zq G Z U {e} is the start symbol. 
Finally, the transition function is a map 

5: Q x (EU{e}) x (Z U {e}) P fin (Q x Z*) 

where Vfin(Q x Z*) stands for the set of all finite subsets of Q x Z* . 

The automaton is represented in Figure [5] and works in the following way. The automaton 
reads a word w G E* from the input tape, letter by letter, from left to right. At any time, it is in 
some state q G Q, and the stack contains a word Q G Z* . If the current letter of w is a, the state 
is q and the top symbol of the stack word £ is z, then it performs one of the following transitions: 

(i) M. can move to the next position on the input tape. If the letter read is a, M. selects some 
(q',C) € <5(g, a, 2;) , changes to state q' , and replaces the rightmost symbol z of £ by If 
there are no more letters on the input tape the machine halts. 

or, without advancing the tape, 

(ii) M. can select some (<?', £') G S(q, s, z), changes to state q' , remain at the current position on 
the input tape and replace the rightmost symbol z of ( by (' . Note that M can make several 
successive moves of this type without advancing the tape. Transitions of this type are called 
e-transitions. 
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Input tape 



W 



Finite control 




state: q 





Stack 



Figure 2: Representation of a pushdown automata. The input tape contains the word w and its current letter is a. 
The stack contains the word £ starting by the letter z. 

If both 8{q, a, z) and S(q, e, z) are empty then M halts. 

Note that, in general, a pushdown automaton is nondeterministic in the sense that it has more 
than one choice of a possible transition. A pushdown automaton M. is deterministic if for any 
5 £ Q, a 6 E and z £ Z U {e}, it has at most one option of what to do next, that is, 

\S(q,a,z)\ + \S(q,e,z)\ < 1. 

Since we are interested in groups, our convention is that the automaton is allowed to continue 
to work when the stack is empty, i.e., when £ = e. Then the automaton acts in the same way 
as before, by changing to state q' and putting £' in the stack if it advances the tape and selects 
(<z',C') G (5(g, a, e) in case (i), or by making an e-transition (V, C') € S(q,e,e) in case (ii). This 
convention is different from that of many authors, for example [50(, who require the automaton 
to halt on an empty stack. 

Let w £ E*, q £ Q, and £ £ Z*. We write hA h (q,() if, starting at the initial state go and 

w 

with only zq in the stack, it is possible for the automaton M (after finitely many transitions) to 
be in state q with £ written on the stack, after reading the input w. If q £ F and £ = e we say 
that M. accepts w. The language accepted by M. is then defined by 

L(M) := {w £ S* : M h (q,e) for some q £ F}. 



Example 2.6. Every finite automaton A may be viewed as a pushdown automaton. Indeed, if 
A = (Q, S, 5, goj F), consider the pushdown automaton A4 = (Q, E, Z, 5', qo, F, e), where Z = 
and the transition function 5' : Q x (E U {e}) x {e} — > Va n (Q x {e}) is defined by setting 

5'(q,a,e) = {(q',e) : q' £ 5(q,a)} 

for all q £ Q and a £ E. It is clear that L(A) = L(A4). Note that M. is deterministic whenever 
A is deterministic. 

The following is a fundamental characterization of context-free languages (see (22I Eil. [Ho| ) . 

Theorem 2.7 (Chomsky). Let E be a finite alphabet and L C E* be a language. Then L is 
context-free (that is, it is generated by a context-free grammar) if and only if it is accepted by a 
pushdown automaton. Moreover, L is unambiguous if and only if it is accepted by a deterministic 
pushdown automaton. 

Note that since there exist inherently ambiguous context-free languages (which therefore are not 
accepted by any deterministic pushdown automaton), it follows that nondeterministic pushdown 
automata are strictly more powerful than deterministic ones. 

Example 2.8 (The Dyck language revisited). Let n > 1 and E = {a±, 01, 02, 0,2, a n ,a n }. 
Consider the deterministic pushdown automaton A4 = (Q, E, Z, 5, q$, F, zq) with Q = {qo} = F, 
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Z = S, z = e and S: {q } x (S U {e}) x (S U {e}) -> ^({go} x £*) defined by setting 



<$(<?o,a, z) 



{(qo,e)} ]£a = z 
{(qo 7 za)} otherwise 



for all a, z G £ U {e}. (We use the convention that e = e.) Then it is easy to check that L(Ai) is 
the Dyck language defined in Example 12.11 

2.6. Turing machines, computable and computably enumerable languages 

One of the great accomplishments of twentieth century mathematics was the formalization 
of the idea of being "computable" . Probably the clearest model is Turing's concept of a Turing 



machine 97|, which one can consider as an idealized digital computer. Several other definitions 
were proposed in the 1930's and 1940's and all of these definitions have been shown to be equiv- 
alent. The Turing machine model of computation is the one still used in studying computational 
complexity, where one wants to investigate how difficult it is to calculate something. 

Thesis 2.9 (The Church- Turing Thesis). Any function intuitively thought to be computable is 
computable by a Turing machine. 

Seventy years of research have led to the general acceptance of the Church- Turing Thesis. By 
the word "algorithm" we therefore mean a Turing machine. 

We give a brief description of how a Turing machine works. This description is illustrated in 



Figure [31 For a careful detailed discussion see [22|, |23|, [50( • A Turing machine T consists of the 
following: 

• A tape which is divided into consecutive cells or squares and which is infinite to the right. 
Thus the Turing machine always has enough tape for any computation, that is, it has unlim- 
ited memory. There is a tape alphabet T which contains a special blank symbol b. The input 
alphabet is S C T \ {b}. Each cell contains a symbol from the tape alphabet and initially, 
all but finitely many cells contain the blank symbol b. 

• A reading head that can read and write symbols on the tape and then move one cell to the 
right or one cell to the left. Symbols L and R stand for "left" and "right", respectively. 

• A finite set Q of control states with an initial state qo G Q and a halting state H € Q. 

• A program or transition function S:QxT^QxTx {L, R}. There is only one type of 
instruction and Turing machines are thus the ultimate in "reduced instruction set architec- 
ture". If S(q, r y) = (q',j',L/R) then the machine immediately halts if q' = H. Otherwise 
the machine does the following operations in sequence: 

— replace the symbol 7 by the symbol 7', which may be the same as 7 or may be the 
blank b, 

— move the reading head one cell to the left (on L) or one cell to the right (on R), 

— assume the new state q' G Q. 

A word w G £* is written on the tape if it occupies the leftmost cells of the tape. It is 
understood that all the cells that are on the right of the cell containing the last letter of w contain 
the blank symbol b. 

Turing machines can be regarded either as calculators of functions or as enumerators. 

Definition 2.10. Let Si and £2 be finite alphabets. A function / : ' s computable if 

there exists a Turing machine T which, when started in its initial state with the reading head at 
the left end of the tape and a word w G written on the tape, eventually halts with f(w) G £2 
written on the tape. 

A set L C £* is computable if its characteristic function \l '■ £* — > {0, 1} is computable. 
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Infinite tape 
71 I 72 I • • 



b b 



Read/write head 



Finite control 




state: q 





Figure 3: Representation of a Turing machine. 



Note that a Turing machine which calculates a function is required to halt on all inputs. In 
general, a Turing machine with input alphabet £ may not halt on all inputs. 

Definition 2.11. A set L C E* is computably enumerable if there exists a Turing machine T with 
input alphabet £ such that T halts on input w if and only if w G L. We say that T enumerates 
or accepts L. 

Thus computably enumerable languages are exactly the halting sets of Turing machines. The 
following lemma is a basic fact about computability. 

Lemma 2.12. A set L CS* is computable if and only if both L and its complement = S* \ L 
are computably enumerable. 

Proof. A basic principle of constructing Turing machines is that a Turing machine T can be always 
be used as a subroutine in a larger machine T ■ If L is computable, let T compute the characteristic 
function \l of L. The machine 7~ enumerating L works as follows. On input w, the machine T 
uses T to compute xl(w). If w £ L then T halts. If w ^ L then T goes into a loop and never 
halts. The machine enumerating the complement ->L works similarly. 

Conversely, suppose that 71 and 72 enumerate L and ->L respectively. The machine T com- 
puting L uses the basic technique of "bounded simulation" . On input w, the machine T begins 
successively enumerating positive integers n. When n is enumerated, T simulates both 71 and T2 
on input w for n steps and sees if either machine halts in n steps. Since L and -iL are complements, 
exactly one of 71 or T2 will eventually halt on input w. When one of them halts, 7" then erases 
its tape and writes 1 if 71 halted and if 72 halted. □ 

Note that in order to be able to prove that a problem is not computable, it is necessary to have 
a complete list of all possible means of computation. We can assume that the input alphabet of a 
Turing machine contains the symbols and 1 . It is not difficult to effectively assign a unique binary 
number g(T) to each Turing machine T (see [Hol|). The Halting Problem for Turing machines is 
the following problem: given a Turing machine T and an input w G {0, 1}*, docs the machine 
T halt on input w? Turing [97| showed that the Halting Problem is not computable. Once one 
has a non-computable language L, one can use "reduction" to show that a language L' is not 
computable by showing that L is reducible to L' in the sense that if L' were computable then L 
would be computable. All non-computability results eventually go back to the Halting Problem. 

3. Finitely generated groups, Cayley graphs, and the Word Problem 

3.1. Labelled graphs 

A labelled graph is a triple V — (V,E, £), where V = V(T) is the set of vertices, £ is a finite 
alphabet, and E = E(T) C V x S x V is the set of oriented, labelled edges. 
Let T = (V, E, S) be a labelled graph. 

We say that T is finite if its vertex set V is finite and thus the edge set E is also finite. 

Given an edge e = (it, a, v) £ E its label is A(e) := a 6 S, its initial vertex is o(e) := u G V, 
and its terminal vertex is t(e) := v G V. We say that e is outgoing from u and ingoing into v. An 
edge e can be visualized as an arrow from o(e) to t{e). 
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For v £ V we denote by d°{v) £ [0, oo] (resp. <9'(t>) € [0, oo]) the number (possibly infinite) 
of edges outgoing from (resp. ingoing into) v. The quantity d(v) = d°(v) + d t {v) £ [0,oo] is the 
degree of v. An edge of the form (y,a,v) is called a loop at i> and is both an outgoing edge and 
an ingoing edge at v, and so contributes 2 to d(v). If d(v) < oo for all v £ V one says that T is 
locally finite. If the degrees of the vertices of Y are uniformly bounded, that is sup^gy d(v) < oo, 
one says that Y has bounded degree. 

Suppose that S is equipped with an involution a <— > a. We then say that T is symmetric if 
for each edge e = (u, a, v) £ E, the inverse edge e -1 = (v, a, it) also belongs to E. The drawing 
convention for symmetric graphs is that one draws only one directed edge (with the corresponding 
label) choosing between e and e _1 . 

Note that if T is symmetric, we clearly have d°(v) = for each v £ V. If, in addition, 

there exists d £ N such that d = d°(v) = 9*(u) for all v £ V, one says that T is regular of degree 
d. 

We say that T is deterministic if at every vertex all outgoing edges have distinct labels. 

Note that our definition of a labelled graph allows multiple edges, i.e., distinct edges of the form 
ex = {u, a±, v) and e 2 = (u, 02, v), but this implies that a\ 7^ a-i- Thus, two edges must coincide if 
they have the same initial vertex, the same terminal vertex, and the same label. 

A subgraph of Y is a labelled graph Y = (V,E, E) such that V £V ,~E £ E and E C E. 

Let T' = {V , E' , E) be another labelled graph with the same label alphabet E. A labelled graph- 
homomorphism from T to T' is a map (p: V — > V such that (tp(u), a, <p(v)) £ E' for all (u, a, v) £ E. 
A labelled graph-isomorphism from Y to T' is a bijective labelled graph-homomorphism from Y to 
r' such that the inverse map ip^ 1 : V — > V is also a labelled graph-homomorphism from V to T. 
Note that if ip is a labelled graph-isomorphism from r to V, then the map ip: E -> E 1 defined by 
ip(u, a, v) = {tp{u), a, <p{v)), for all (u, a, v) £ E, is bijective with inverse map ip -1 '■ E' E given 
by ^- 1 (u',a,w / ) = (<p- 1 ('u / ) ) a,^" 1 ( u/ ))> for a11 (u',a,v') £ E'. 

A path in r is a sequence 7r = (ei, ei, ■ ■ ■ , e n ) of edges such that o(ei + i) = t(e.i) for i = 
1,2, ...,n — 1. We extend our notation for initial and terminal vertices to paths. The vertex 
o(ir) := o(ei) is the initial vertex of it and t(jr) := t(e„) is the terminal vertex of tt. Wc then says 
that 7r starts at 0(71") and ends at i(7r), cquivalently it connects o(tt) to t(ir). An edge e £ E such 
that o(e) = £(e) is called a Zoop. For every vertex v £ V, we also allow the empty path starting 
and ending at v. 

One says that T is strongly connected provided that for all vertices u,v £ V there exists a 
path connecting u to v. If V is symmetric, the (obviously reflexive and transitive) relation in 
V defined by u ~ v provided that there exists a path in T connecting u to v is also symmetric 
and therefore an equivalence relation. Then the corresponding equivalence classes are called the 
connected components of T; clearly, Y is strongly connected if and only if there exists a unique 
such a connected component. 

Let 7r = (ei, e2, . . . , e n ) be a path. The number |-7r| — n of edges is the length of the path. The 
label of 7r is X(n) := A(ei)A(e 2 ) • • • A(e„) £ E*. The empty path has length and is labelled by the 
empty word e. If t(n) = o(n) one says that 7r is closed. If the vertices o(ei), t(ei), t(e%), . . . , t(e n ) 
are all distinct, then the path is called simple. If n is closed, contains an edge and its vertices are 
all distinct with the exception of o(ei) = t(e n ), then tt is called a cycle. 

Denote by Tl UtV (Y) the set of all paths tt in Y with initial vertex o(tt) = u and terminal vertex 
t(n) = v. More generally, given a subset F C V we set ILj^r) := Ut,e_F n u ,„(r). For u £V and 
Fcl^ we define the language 

L UtF (Y) := {A(tt) : tt g n„, F (r)} C E*. 

Note that L Ui f(Y) may be empty. 

Suppose that a given vertex vq £ V of Y is fixed as origin (or rooi or basepoint). One then says 
that r = (V,E, E,«o) is a rooted labelled graph. A rooted labelled graph-homomorphism (resp. 
rooted labelled graph-isomorphism) from a rooted labelled graph Y into a rooted labelled graph 
r' is a labelled graph-homomorphism (resp. labelled graph-isomorphism) ip: V — > V' such that 
ip(vo) = Uq, where v' Q £ V is the root of Y' . 
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Example 3.1 (The rooted infinite binary tree T2). Let E = {0, 1}. Consider the rooted labelled 
graph r = (£*,£7,E,£) where E = {(y,a,va) : v G E*,a G E}. The vertex corresponding to the 
empty word e is the root of F. Note that for every vertex v S V one has d°(v) = 2. 

The graph T is a rooted, directed tree called the rooted infinite binary tree and it is denoted 
by T2. Figure H] illustrates it . 




Figure 4: The rooted infinite binary tree T2. 



Example 3.2 (The graph underlying a finite state automaton). Let A = (Q, E, S, qo, F) be a finite 
state automaton. Consider the labelled graph T = (V, E, E) where V = Q and EcVxT^xVis 
defined by 

E = {(u, a, v) : u S V, a € £, and u £ <5(u, a)}. 

Note that F is deterministic if and only if A is deterministic. The language L(A) C E* accepted 
by A can be reinterpreted as the language consisting of all words of the form A(7r), where tt is a 
path in T starting at the initial state go and terminating at some final state in F. In symbols: 

L(A) = L qo , F (T) = {A(tt) : n e Il 9o , F (F)}. 

Presentations and Cayley Graphs 
A finitely generated group presentation is a pair (X; R), where X is a finite set of generators, 
the group alphabet is E = X U X" 1 where X -1 is a disjoint copy of X, and the set R of defining 
relators is a subset of S*. We denote by a n- a^ 1 the involutive map on £ exchanging X and 

Two words u, v S E* are said to be equivalent, written u rj u, if it is possible to transform u into 
v by a finite sequence of insertions or deletions of either the defining relators r € R or the trivial 
relators of the form is" 1 and with x £ X . The concatenation product on the free monoid E* 

(cf. Equation Q2.1JI ) induces a group structure on the set G = £*/ « of equivalence classes whose 
identity element is the class of the empty word e. Moreover, if w = a\a% • • ■ a n £ E*, the inverse of 
the class of w is the class of the element w _1 E E* defined by w^ 1 = a^ 1 ■ ■ ■ a^" 1 aj~ 1 . One says that 
(X;R) is a presentation of the group G and one writes G = (X;R). When the defining relators 
r G R are of the form r = u r v~ for some u r ,v r £ E* one often writes G — (X;u r — v r ,r € R) 
and refers to the equations u r = v r , r S R, as the defining relations. 

A presentation (X; R) where both X and the set R of relators is finite is called a finite pre- 
sentation. A group admitting a finite presentation is called finitely presentable. 

Given a presentation G = (X;R), if Fx denotes the free group based on X and N is the 
normal closure of R in Fx then the group homomorphism Fx — > G sending each x G X to its 
w-equi valence class in E* induces a group isomorphism Fx/N — > G. 
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Example 3.3. (a) Let G = {31,32, • ■ • , 3n} be a finite group where 31 is the identity element. 
The multiplication table presentation of G is the presentation 

G = (32,33, ■ • -,gn\gigj = 9k(i,j),h3 = 2,3, ...,n), 

where gkU,j) is the product of 3^ and gj determined from the multiplication table of G. This 
example shows that every finite group has a finite presentation. 

(b) In multiplicative notation, the infinite cyclic group has a presentation Z = (x) with one 
generator and no defining relations. 

(b') More generally, the free group based on a finite set X has a presentation Fx = (X) with 
generating set X and no defining relations. 

(c) In multiplicative notation, the free abelian group of rank two has a presentation 

Z 2 = (x,y; [x,y]), 

where [x, y] = x~ 1 y~ 1 xy is the commutator of x and y. 
(c') More generally, the free abelian group based on a finite set X has presentation 

(X; [x,y],x,y e X). 

In this case, the normal closure TV of R = {[x,y],x,y € X} in the free group Fx based on X 
is the commutator (or derived) subgroup of Fx- 

(d) Let G = (Z/2Z) * (Z/2Z) * (Z/2Z) * (Z/2Z) be the free product of four copies of the group Z/2Z 
with two elements. Let x, y, z and w be the nontrivial elements in each copy of Z/2Z (so that 
x = x^ 1 ,y = etc). Then the corresponding presentation is G = (x,y,z,w;x 2 ,y 2 ,z 2 ,w 2 ). 

The fundamental geometric object associated with a finitely generated group was defined by 
Cayley [H in 1878. 

Definition 3.4 (Cayley graph). Let G = (X; R) be a finitely generated group. The Cayley graph 
of G with respect to the presentation (X; R) is the labelled graph L = T(G : X; R) whose vertex 
set is V(T) = G, the set of labelled, directed edges is 

E(T) = {(g,x,gx):geG,xe^}, 

and the label alphabet is £ = IUI" 1 . 

Let r = L(G : X; R) be a Cayley graph. Then L is often regarded as a rooted graph with 
basepoint vq = 1q and is strongly connected: between any two vertices u and v there is at least 
one path from u to v. Note that a word nieE' labels a closed path in L(G : X; R) if and only if 
w represents the identity in G. Moreover, T is symmetric (with respect to the involution a > a~ x 
on S) and |X|-regular. If h £ G then the map \ih : G — > G, defined by ^(3) = hg for all 3 6 G is 
a labelled graph automorphism of L. Thus a Cayley graph is homogeneous in the sense that given 
any two vertices there is a lebelled graph automorphism taking the first vertex to the second. 

Example 3.5. (a) In Figurc[5]we illustrate the Cayley graph of the Klein 4-group Z/2Z x Z/2Z 
with respect to the multiplication table presentation (x,y,z;x 2 = y 2 = z 2 = 1, xy = z = 
yx, xz = y = zx, yz = x = zy). 

(b) The Cayley graph T(Z : x) is described in Figure [B] 
(b') The Cayley graph T(F 2 : x,y) is described in Figure [7] 

(c) The Cayley graph T(Z 2 : x,y; [x,y]) is described in Figure [5] 
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X 




Figure 5: The Cayley graph of the Klein 4-group Z/2Z X Z/2Z, with respect to the multiplication table presentation 
(x, y, z; x 2 = y 2 = z 2 = 1, xy = z = yx, xz = y = zx, yz = x = zy). 

x -2 -J^ x -l •'' > 1 •'' > X > 

Figure 6: The Cayley graph of the group Z = (x) 



yx' 1 y 2 yx' 




— 1 —1 — 9 —1 

y x y y x 



Figure 7: The Cayley graph of the free group F2 = (x, y) 

(d) The Cayley graph T(G : x,y, z,w; x 2 ,y 2 , z 2 ,w 2 ), where G = (Z/2Z)*(Z/2Z)*(Z/2Z)*(Z/2Z), 
is described in Figure [HI 

Note that the Cayley graphs in (c) and (d) are 4-regular directed trees and they are isomorphic 
as directed graphs. However they are not isomorphic as directed labelled graphs. 

Let G — (X; R) be a finitely generated presentation and let T = T(G : X; R) be the correspond- 
ing Cayley graph. When equipped with the metric dist: V xV ^ [0, 00) defined by dist(it, v) = 
min{|7r(u, v)\ : n G II Uj1 ,}, T is a discrete metric space. Denote by B n = {g e G : dist(<7, 1g) < n] 
the ball of radius n centered at 1q- The map 7 = 7(6? : X; R) : N — > N defined by 7(71) = \B n \ 
for all n £ N is called the growth function of G with respect to the given presentation. Since 7 
is subadditive (i.e. 7(71 + m) < 7(71)7(771) for all n,m € N), by a well known result of Fekete the 
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x -l y J^y ^xy 



_1 _[i _i x _ x 

a; y — > y > xy 



Figure 8: The Cayley graph of the group Z 2 = (x, y; xy = yx) 



' yw yz yx '' 




w xy 




wx wy wz 

Figure 9: The Cayley graph of the group (Z/2Z) * (Z/2Z) * (Z/2Z) * (Z/2Z) = (as, y, z, w; x 2 = y 2 = z 2 = w 2 = 1) 
limit 

A = A(G : X; R) = lim a/tWi 
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exists and 1 < A < oo. This limit is called the growth rate of G with respect to the given 
presentation, 

That A = 1 is a condition independent of the particular presentation. If G = (X'; R') is 
another finitely generated presentation of G and 7' is the corresponding growth function, then 
A' = linin^oo y/"j'{n) equals 1 if and only if A does. If A = 1 one says that the group G has 
subexponential growth. Otherwise, the group G is said to have exponential growth. All finite groups, 
all finitely generated abelian groups, and, more generally, all nilpotcnt groups have subexponential 
growth. On the other hand, if Fx = {X} is a finitely generated free group, then A = 2\X\ — 1 so 
that Fx has exponential growth if \X\ > 2. 

3.3. The Word Problem 

In a remarkable paper in 1911, twenty years before the development of the theory of computabil- 
ity, Dehn [26j posed three fundamental decision problems in group theory: the Word Problem, 
the Conjugacy Problem, and the Isomorphism Problem. (See also the expository article by de la 
Harpe [45|.) Dehn viewed the Word Problem as the following algorithmic problem: given a finitely 
generated group presentation G = (X;R) find an algorithm which, when given a word w G £*, 
decides, in a finite number of steps, whether or not w represents the identity element of G. In 
1912 Dehn [27j solved this problem for the fundamental group of a closed orientable surface: 

h 

G h = (ai,bi,aa,b 2 , ■ ■ ■ ,a h ,b h ;Y\[ai,bi\) 

i=l 

where h > 2 is the genus of the surface. 

Given a finite group presentation G = (X; n, . . . , Tk), let R be the symmetrized set generated 
by the given relators, that is, R consists of all cyclic permutations of the rj and their inverses. 
Then (X; R) is also a presentation of G. The original presentation is a Dehn presentation if every 
nontrivial word w equal to the identity in G contains a subword u such that some r G R has the 
form r = uv where \u\ > \v\. This says that every nontrivial word equal to the identity contains 
more than half of a cyclic permutation of the given relators or their inverses. 

Although we usually do not write the trivial relators, if X is a finite set and Fx is the free 
group based on X , then a Dehn presentation of Fx is given by (X; xx~ l , x~ l x, x G X). 

Now, every group admitting a Dehn presentation has solvable Word Problem. Indeed, if 
G = (X; n, . . . , ru) is a Dehn presentation, let R be the symmetrized set of relators generated by 
the ri. We then have the following algorithm, now called Dehns algorithm, to decide whether or 
not a/eS' represents the identity element in G: 

Step 1) if w = s then w does represent la, otherwise go to the next step; 

Step 2) if w contains a subword u where for some r G R, r = uv with \u\ > \v\, then replace u by 
v" 1 and go to Step 1. Otherwise, w does not represent 1q- 

Note that since each step in the algorithm strictly reduces the length of the word being considered, 
Dchn's algorithm takes only linearly many steps and thus works in linear time, which is the best 
possible complexity result. The Cayley graph of the surface group Gh,h > 2 is the dual graph 
of the regular tessellation of the hyperbolic plane by 4/i-gons. Dehn used hyperbolic geometry 
to show that the presentation of Gh given above is a Dehn presentation and thus G has solvable 
Word Problem. The quest to extend Dehn's algorithm to a larger class of groups led to the 
development of small cancellation theory which, among many other things, gives some simple 
sufficient conditions for a presentation to be a Dehn presentation. (See [92j for a survey.) This 
then led to Gromov's [40j | remarkable development of the theory of word-hyperbolic groups. As 
mentioned before, the Cayley graph of a finitely generated group becomes a metric space by 
defining the distance between two vertices as the minimal length of a path connecting them and 
considering each edge as isometric to the unit interval. The thin triangle condition then captures 
many of the features of hyperbolic geometry. One of the characterizations of a group G being 
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word-hyperbolic is exactly that it has some Dehn presentation. (See and also 0, Chapter 
HIT, Theorem 2.6.].) 

Solvability of the Word Problem was extended to all one-relator groups by Magnus [65j in 
1932. We do not, however, know any bound on the complexity of solving the Word Problem over 



the class of all one-relator groups. A theorem of Newman [87j, |63j shows that any one-relator 
presentation of the form G = {X; w n ) with n > 2 is a Dehn presentation. 

It was independently shown by Novikov jg^l in 1955 and by Boone [1] in 1958 that there exist 
finitely presented groups G = (X; R) with unsolvablc Word Problem. In order to prove this basic 
result it is necessary to code the Halting Problem for Turing machines into the Word Problem of 
the group. The unsolvability of the Word Problem is the foundation of all the unsolvability results 
in group theory and topology. 



3-4- The Dehn function 
Let 

G = (X-R) (3.1) 

be a finite presentation of G. Let £ = XUX^ 1 denote the associated group alphabet and suppose 
that w € S* satisfies w « e, that is, w = 1q in G. This is equivalent to saying that the reduced 
form of w belongs to the normal closure N of R in Fx, the free group based on X. This in turn 
is equivalent to the existence of an expression 

w = uiriu^ 1 u 2 r 2 u 2 1 ■ ■ ■ u m r m u~} (3.2) 

where m £ N, u, £ S* and r,; G % = 1, 2, . . . ,m. Then the area of w (with respect to the given 
presentation (|3.1[l ). denoted Aie&(w), is the smallest m > such that an expression of the form 
above holds. The Dehn function associated with the presentation (|3.1[) is the map Dehn: N — > N 
defined by 

Dehn(n) = max{Area(w) : w € £*,u> ps e, \w\ < n}. 
Cannon observed the following (see also [1^, Theorem 2.1]). 

Theorem 3.6. A finitely presented group presentation G = (X; R) has a computable Dehn func- 
tion if and only if the group G has solvable Word Problem. 

It is not difficult to show that if w = Iq in G then in an expression Q3.2p the length of all the 
conjugating elements Ui can be bounded by \w\. Thus if we can calculate Dehn(|w|) = b we can 
try all possible products of the form (|3.2[) with m < b and all < \w\ and check whether any of 
these products equals w in the free group. 

3.5. The Word Problem as a formal language 

Anisimov Q in 1972 introduced the fruitful idea of viewing the Word Problem as a formal 
language, a point of view which we now adopt. 

Definition 3.7. Let G = (X;R) be a finitely generated group presentation. The Word Problem 
of G, relative to the given presentation, is the language 

WP(G : X; R) = {w E S* : w e}, 

where S is the group alphabet as usual. One says that the G has regular (resp. context-free, resp. 
computable) Word Problem with respect to the given presentation if WP(G : X; R) is a regular 
(resp. context-free, resp. computable) language. 

Note that, the Word Problem for a finitely generated group presentation G — (X; R) is solvable 
(in the sense of Dehn) if and only if the language WP(G : X;R) C S* is computable. 
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Observation 3.8 (Invariance and finitely generated subgroups). It is easy to see that the clas- 
sification above of the Word Problem as a formal language is actually a property of the group 
and does not depend on the particular presentation considered. Indeed, the complexity of the 
Word Problem of a finitely generated group bounds the complexity of the Word Problems of all 
its finitely generated subgroups. For, suppose that G = (X; R) has a Word Problem of a given 
type and that H = (Y; S) is a finitely generated presentation of a group isomorphic to a finitely 
generated subgroup of G. Let <f> '■ H — > G be an injcctive homomorphism and for y G Y denote by 
w y G £* a representative of the image <f>(y). So, whether a finite automaton, pushdown automaton 
or Turing machine 371 accepts the Word Problem for the first presentation, we can construct a 
machine 371' of the same type which, on reading a letter (y) ±x G fU Y^ 1 simulates the sequence 
of transitions of 50? on reading the word (w J ,) ±1 G £*. 

As a consequence, we say that a finitely generated group G is context-free provided that the 
Word Problem WP(G : X; R) relative to some (equivalently, every) finitely generated presentation 
G = (X;R) is context-free. 

Anisimov Q characterized groups with regular Word Problem. 

Theorem 3.9 (Anisimov). Let G — (X;R) be a finitely generated group. Then G has regular 
Word Problem if and only if G is finite. 

Proof. Suppose that G is finite. Consider the deterministic finite automaton A = (Q,T,,S,qo,F) 
where Q = G, 1q is both the initial state go and the unique element in F, and i5:QxE-^ V(Q) 
is given by 



for all q G Q and a G A. Note that the graph underlying A (cf. Example 13. 2[) is the Cayley graph 
T(G; X; R). Then A accepts exactly the Word Problem of G. 

Conversely, if G is infinite there are arbitrarily long words w G £* such that no nontrivial 
subword of w is equal to the identity in G. Suppose that A = (Q, S, 5, go, F) is a deterministic 
finite automaton with alphabet the group alphabet £ and let n denote the cardinality of its state 
set Q. Taking a word w as above and such that \w\ > n + 1, then there exist q G Q and words 
wi, W2, u>3 G S* satisfying w = W1W2W3 with W2 7^ £ such that A, when reading w, is in the same 
state q after reading the initial segment w\ and after reading w\Wq,. Then if A is in the state 
q' G Q after reading wiw^ 1 it is in the same state after reading W1W2W1 1 . But the first word 
equals the identity in G while the second word does not. It follows that A cannot accept the Word 
Problem of G. □ 

3.6. Context-free groups 

Recall that a group is context-free if it is finitely generated and its Word Problem with respect 
to some (equivalently, every) finitely generated presentation is a context-free language. 

Example 3.10 (The Word Problem for the free group). Let X be a finite set and let G = Fx be 
the free group based on X. Recall from Example 12.31 that G is in one-to-one correspondence with 
the set L re( j(S) of all reduced words over the alphabet E = XUX^ 1 . We adopt the convention that 
e = e. Consider the one-state deterministic pushdown automaton M. = ({go}, S, S, S, go, {go}, e)- 
The automaton starts with empty stack, accepts by empty stack, and the transition function is 
defined by 



for all a, z G E. It is clear that L(M) = WP(G : X; R), so the Word Problem for G is context-free. 
It follows that free groups are context-free. 

For the next example we need the following well-known result (see (Hoi . Lemma 6.1]). 



<5(g, a) = qa 
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Lemma 3.11 (The Pumping Lemma for context-free languages). Let X be a finite alphabet. Let 
L C S* be a context-free language. Then there exists a positive integer N = N(L) such that if 
w G L and \w\ > N , then we can find u, v, z,s,t € S* such that w — uvzst, \v\ + \s\ > 1, \vzs\ < N 
and uv n zx n y G L for all n > 0. 

With the notation from the above lemma, we say that the word uv n zs n t is obtained from w 
by pumping the subwords v and s. 

Example 3.12 (The Word Problem for the free abelian group of rank 2). Let G = Z 2 with 
presentation (x,y; [x,y]). Then x m y m x~ n y~ n = 1 in G if and only if m = n. We can now use 
Lemma 13.111 to show that L = WP(G : X;R) is not context-free. Suppose by contradiction 
that L is context-free and let N = N(L) be the corresponding positive integer. Consider the 
word w = x N+1 y N+1 x~( N+1 ^y~( N+1 \ We clearly have w G L. However, there are no subwords 
u, v, z, s, t of w satisfying the conditions described in the Pumping Lemma. Indeed, from \vzs\ < N 
we deduce that vzs is a subword of one of the following forms: (i) x m , (ii) x p y q , (in) y h x~ k , (iv) 
x~ p y~ q , or (v) y~ m , for suitable positive integers m,p,q,h and k. In all these cases, by pumping 
n > 2 times the subwords v and s, we obtain a word w' whose number of positive occurrences of 
x or of y fails to equal the number of its negative occurrences so that w' L, contradicting the 
Pumping Lemma. It follows that L is not context-free. Therefore Z 2 is not a context-free group. 

Proposition 3.13. Let G be a finitely generated group and H a subgroup with [G : H] < oo. 
Then G is context-free if and only if H is context-free. 

Proof. We have already seen the "only if" part in Observation 13.81 Conversely, let H be a finite 
index subgroup of G and suppose that it is context-free. Recall the following general fact from 
group theory (sometimes called the Poincare Lemma): a subgroup of finite index in a finitely 
generated group G contains a subgroup which is normal in G and also of finite index, and which 
is therefore finitely generated. Thus this normal subgroup is also context-free if the ambient 
subgroup is context-free. We can therefore suppose that H is normal in G. Let K = G/H be 
the corresponding finite quotient with ip: G — s> K the natural quotient map. Let K = {k\ = 
1, k2, ■ ■ ■ , k n }. Let H = (hi, h2, ■ . ■ , h m : R) be a presentation of H. Since H is normal in G, if 
hi G G is such that ?A(^i) = ki we bave relations of the form 

krhJjk' 1 = w(r,j,rf) 

where rj = ±1 and w(r,j,r]) is a word in the generators hi and their inverses. Because H is a 
normal subgroup we also have the relations 

k r k s — z(r, s)kj.( r s j 

where z(r, s) is a word in the generators hi and their inverses determined by the relation k r k s = 
kt( r ,s) m the multiplication table of K. So a presentation of G is 

G = (fc 2 , ■ • ■ , k n ,hi,h 2 , • ■ • , h m ; k~ x h^kr = w(r,j, n),k r k s = z(r, s)k t ^ !S ),R), 

where r, s = 2, . . . , n, j = 1, 2, . . . , m, and rj = ±1. 

Let M. be a pushdown automaton accepting the Word Problem of H for its presentation above. 
The idea of constructing a pushdown automaton M to accept the Word Problem of G for the 
above presentation is very simple. On reading a word w, the automaton A4 uses extra master 
control states to keep track of the image ip(w) in K and uses the stack to simulate M. on the 
Word Problem of H . The automaton M. starts with empty stack in the master control state 
corresponding to \k- If M. is in the master control state corresponding to k r and A4 reads a letter 
k s it uses a sequence of auxiliary states to simulate M. reading the word z(r, s) and then changes 
to the master control state corresponding to k t where k t = k r k s in K . If M. reads a letter hJj while 
in the master control state corresponding to k r in the quotient group it uses a series of auxiliary 
states to simulate M. reading the word w(r,j,rf). Finally, M. accepts by having empty stack and 
master control state corresponding to Ik- □ 
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Definition 3.14. A group G is virtually free if G contains a free subgroup H of finite index in G. 



Corollary 3.15. A finitely generated virtually free group is context-free. 

Proof. Let G be a finitely generated virtually-free group and let H C G be a free subgroup of 
finite index. Then H is finitely generated and, as seen in Example l3.10l context-free. By the "if" 
part of the previous proposition, we have that G is context-free as well. □ 

Muller and Schupp fT9| proved the following characterization of groups with context-free Word 
Problem. 

Theorem 3.16 (Mullcr-Schupp). Let G be a finitely generated group. Then G is context-free if 
and only if G is virtually free. 

Remark 3.17. In [2l[ Ceccherini-Silberstein and Woess introduced and studied the concept of a 
context-free pair of group. Such a pair (G,K) consists of a finitely generated group G = (X;R) 
together with a subgroup K C G for which the language consisting of all words over S* = XUX^ 1 
representing an element in K is context-free. (When K reduces to the identity element, this clearly 
specializes to the abo ve de finition of G to be a context-free group.) These investigations were 



extended by Woess in 102| who applied them to the study of random walk asymptotics yielding 



a complete proof of the local limit theorem for return probabilities on any context-free group. 

3.7. Subgroups and embeddability 

We briefly mention some applications of formal language theory to subgroups and embeddabil- 
ity. 

Definition 3.18. Let G = (X; R) be a finitely generated group with group alphabet S = XUX^ 1 . 
Let ip : E* — > G be the natural map. Let S C G be a subset. An enumeration of S is a subset 
L C S* such that tp(L) = S. Then one says that L is a regular (resp. context-free, resp. 
computable) enumeration provided that L is a regular (resp. context-free, resp. computably 
enumerable) language. 

Anisimov and Seifert Q proved in 1975 the following theorem. 

Theorem 3.19 (Anisimov-Scifcrt). Let G be a finitely generated group and let H C G be a 
subgroup of G. Then H is finitely generated if and only if H has a regular enumeration. 

Anisimov and Seifert also proved that context-free groups are finitely presentable, a fact used 
in the proof of the characterization theorem. The following more general result is due to Frougny, 
Sakarovitch, and Schupp (33^ . 

Theorem 3.20 (Frougny-Sakarovitch-Schupp). Let G be a finitely generated group and let N C G 
be a normal subgroup of G. Then N is finitely generated as a normal subgroup (that is, N equals 
the normal closure of a finite set of elements of G) if and only if N has a context-free enumeration. 

Definition 3.21. A computably enumerable presentation (also called a recursive presentation) is 
a group presentation G = (X;R) where the set X of generators is finite and the set R of defining 
relators is computably enumerable. 

Recall that a group H is said to be embeddable into a group G provided there exists an injective 
homomorphism ip: H — ► G. The remarkable Higman Embedding Theorem [48| shows that the 
connection between group theory and computability is intrinsic. 

Theorem 3.22 (Higman). A finitely generated group H is embeddable into some finitely presented 
group if and only if H admits a computably enumerable presentation. 
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3.8. Basic groups and simple languages 

We next consider a special subclass of deterministic context-free languages. 

Definition 3.23. Let £ be a finite alphabet. A language L C S* is called simple if it is accepted 
by a 1-state deterministic pushdown automaton which accepts by empty stack and is required to 
halt when it empties its stack. 

The convention that the automaton accepting a simple language halts on empty stack makes 
a simple language L prefix-free, that is, if w = uv £ L with u and v nontrivial then u L. The 
main reference for simple languages is Harrison [46| . 

Recall that given a language L C X*. the Kleene star of L is the language L* over £ defined 

by 

L* = {wiW2 ■ ■ • w n : Wi € L where i = 1, 2, . . . , n and n = 0, 1, 2, . . .}. (3-3) 

In other words, L* is the submonoid of S* generated by L. 

Since we are interested in groups, the convention that the automaton must halt on empty 
stack is rather unnatural. Note that the language accepted by a 1-state deterministic pushdown 
automaton which is not required to halt on empty stack is the Kleene star L* , of a simple language 
L (see Equation (|3.3p L 

Example 3.24. We show that the Word Problem for a finite group with respect to its multipli- 
cation table presentation is the Kleene star of a simple language. Let G = {gi,<?2, ■ ■ • , <?«} (with 
gi = 1q) be a finite group and consider its multiplication tabic presentation 

G = (ffa, • • • j 9n',9i9j = 9k(i,j)} 

(see Example [331(a)). Let M be the deterministic single state pushdown automaton whose input 
alphabet and stack alphabet are the set {g2, ■ ■ ■ , g n } of non-identity elements of G. The automaton 
M. starts with empty stack and will always have at most one symbol on the stack. If the stack 
is empty and M. reads gi then A4 puts gi on the stack. If the symbol on the stack is gi and A4 
reads gj then M. replaces gi by the product 9k(i,j) if 9%9j is not the identity of G and M. empties 
the stack otherwise. It is clear that M has empty stack exactly when the product of the elements 
it has read so far is the identity, so L(A4)* = WP(G : g2, ■ ■ ■ ,g n ] gigj = 9k(i,j))- 

Definition 3.25. A group G is called basic if it is the free product of finitely many finite groups 
and a free group of finite rank, i.e, G = G\ * G-x * ■ • • * Gk * F n , where Gi is a finite group, 
i = 1, 2, . . . , k, and F n is the free group of rank n, with fc, n > 0. 

Note that finite groups and finitely generated free groups are basic groups. We saw in Example 
I3.24l that the Word Problem of a finite group with respect to the multiplication table presentation is 
the star of a simple language. Analogously, it follows from Example 13.101 that the Word Problem 
of a finitely generated free group with respect to the free presentation is the star of a simple 
language as well. More generally if we take the "canonical presentation" of a basic group given 
by the disjoint union of the multiplication table presentations of the finite factors and the free 
presentation of the free group, then the corresponding Word Problem is the star of a simple 
language. In general, however, having a Word Problem which is the Kleene star of a simple 
language depends on the given presentation. We give an example below (Example I3.29|) . 

Haring-Smith [43| characterized groups whose Word Problem is the star of a simple language. 

Theorem 3.26 (Haring-Smith). A finitely generated group G is basic if and only if it has a finitely 
generated presentation G = (X;R) such that the corresponding Word Problem is the Kleene star 
of a simple language. 

Haring-Smith [43[ also gave the following geometric characterization of basic groups. 

Theorem 3.27 (Haring-Smith). A group G is basic if and only if G has a finitely generated 
presentation such that in the corresponding Cayley graph T the following holds: for every vertex 
v e V^r) there are only finitely many cycles trough v. 
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Indeed, the Word Problem for a given presentation is the star of a simple language if and only 
if its Cayley graph satisfies the above geometric condition. 

Example 3.28 (The modular group). A presentation of the modular group G = PSL(2,Z) = 
(Z/2Z)*(Z/3Z) is G = (x,y;x 2 ,y 3 ). The corresponding Cayley graph T is represented in Figure[TUJ 
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Figure 10: The Cayley graph of the modular group Z/2Z * Z/3Z = (x, y; x 2 = = 1) 

As illustrated in Figure [TU1 for every vertex v <E V(T) there are exactly two cycles through 
v, namely (ei,e 2 ,e 3 ) and (e 3 _1 , e 2 _1 , ei" 1 ), where e\ = (v,y,vy), e 2 = (vy,y,vy 2 ), and e 3 = 
(vy 2 , y, v). As usual, for an edge e we denote by e _1 the opposite edge (see the drawing convention 
for symmetric labelled graphs at page [T2"|) . 

Example 3.29. Consider the presentation (x, y;y = x 2 ) of the infinite cyclic group. In the Cayley 
graph of this presentation there are infinitely many cycles through a vertex (see Figure lllj) and 
the Word Problem for this presentation is not the star of a simple language. 




4. Finitely generated graphs and ends 

4-1- Finitely generated graphs 

We need a framework in which we can discuss both Cayley graphs of finitely generated groups 
and complete transition graphs of pushdown automata. The following definition is from [80( . 

Definition 4.1. A finitely generated graph is a rooted labelled graph V = (V,E,'E,vo) with a 
uniform upper bound on the degrees of vertices, and which is connected from vq, that is, for every 
vertex v £ V, there is a directed path from vq to v. 
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The Cayley graph of a finitely generated group is clearly a finitely generated graph. Other 
examples of finitely generated graphs are provided by the complete transition graph of pushdown 
automata that we now define. 

Definition 4.2 (The complete transition graph of a pushdown automaton). Let M = (Q, E, Z, 6, 
go, F,zq) be a pushdown automaton. The complete transition graph of M. is the labelled graph 
r = Y(A4) defined as follows. The initial vertex is the initial configuration (qo,Zo). The vertex 
set V is the subset of Q x Z* consisting of all configurations (g, £) which are reachable from the 
initial configuration on reading some possible input w S E* . In our previous notation, 

v = {( q ,0-Mh(q,0,weJ:*}. 

w 

If v = (q,C) an d v 1 = (q',C) are two vertices, then there is an oriented edge labelled by a G E 
from v to v' if and only if £ = £qz with z G Z such that there exists (g', Ci) € S(q, a, z) satisfying 

C = CoCi- 

Note that Y(A4) is connected from vq by definition and that there is an upper bound on the 
degrees of vertices. Thus the complete transition graph Y(A4) of a pushdown automaton A4 is a 
finitely generated graph. 

Example 4.3. Consider the deterministic pushdown automaton M = (Q, E, Z, S, go, F, zq) with 
Q = F = {go}, E = Z = {0,1}, zo = e, and transition function defined by 

S(q ,a,z) = (q ,za) 

for all a, z 6 {0,1}. Then the associated complete transition graph of A4 is isomorphic to the 
rooted infinite binary tree T 2 (see Figure HJ) ■ 

4-2. Ends of finitely generated graphs 

Let F = (V, E, E, vq) be a finitely generated graph. Intuitively, an end of F is a way to "go to 
infinity" in T. Although a finitely generated graph is a directed graph, in order to discuss ends, we 
need to consider undirected paths. Let V be the graph obtained by considering T as an undirected 
graph. So if (u, a, v) is an edge of Y then both (u,v) and (v,u) are edges of Y'. In short, one 
now ignores labels and the orientation of edges. An undirected path in Y is a sequence of edges 
(ui,vx), (vi,v 2 ), ■ ■ ■ , (vi,v l+ i), . . . ,{v n ,v n+1 ) forming a path in Y' . 

For a non-negative integer n, we denote by Y n the subgraph of Y whose vertex set V n consists 
of all vertices v £ V such that there exists an undirected path tt with 1 7r | < n from the origin ^o to 
v and whose edge set consists of the edges of Y between two such vertices. r„ is called the ball of 
radius n centered at the basepoint Vq of Y. 

Let n be a non-negative integer. It follows from the finiteness of the degrees of the vertices 
of r that there are only finitely many connected components of Y \ Y„ . Let us denote them by 
Y n ,i, Y n _2, • ■ • , Y n u n ). Let e(n) be the number of infinite connected components of Y \ Y n . Note 
that < e(n) < k(n). Moreover, it is easy to see that e(n) is a non-decreasing function of n. Thus 
the following limit exists in RU {oo}: 

e(r) = lim e(n). 

n— >oo 

It is called the number of ends of Y. 

Example 4.4. (a) Let Y be a finite graph and fix an arbitrary vertex vq £ ^(r). For every 
n > one has Y \ Y n is finite and, in particular, has no infinite connected components, that 
is, e(n) = 0. It follows that e(F) = 0. 

(b) Let r = T2 be the rooted infinite binary tree. Then for every non-negative integer n, the 
vertex set of the ball of radius n centered at vq = s consists of all words in {0, 1}* having 
length at most n. Each connected component of r\T„ has vertex subset V w C V consisting of 
all words in {0,1}* with proper prefix w, where w£ {0,1}" is a word of length n. Since there 
are 2™ distinct words of length n over the alphabet {0, 1}, we have e(n) = 2 n for all n > 0, so 
that e(r) = co. 
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(c) Let r be the Cayley graph of the infinite cyclic group Z = (x) . Then for every non- negative 
integer n the ball of radius n centered at Vq = 1% is the "interval" from x~ n to x n . Thus, 
r \ r„ consists of the two disjoint intervals C< n = {x m : m < —n} and C> n = {x m : m > n}. 
Thus e(n) = 2 for all n > 1 so that e(T) = 2. 

(d) Let r be the Cayley graph of Z 2 with respect to the presentation Z 2 = (x,y; [x,y]). Then 
for every non-negative integer n the ball of radius n centered at the origin is the "square" 
r n = {x p y q : \p\ + \q\ < n}. Thus, T \ T n consists of a single connected component, namely 
C>„ = {x p y q : \p\ + \q\ > n}. Hence e{n) = 1 for all n > so that e(r) = 1. 

Remark 4.5. If G is a finitely generated group then the number of ends of the Cayley graph 
of any finitely generated presentation of G is the same. Thus e(G), the number of ends of G, is 
well defined and does not depend on the presentation. It is a fact that the number of ends of any 
finitely generated group is either 1 1 5 _2, or o o. W e also remark that if e(G) = oo then G contains 



Theorem 4.6 (Stallings). Let G be a finitely generated group. Then e(G) > 1 if and only if one 
of the following holds: 

• G admits a splitting G = H *c K as a free product with amalgamation, where C is a finite 
proper subgroup of both H and K ; 

• G admits a splitting G = {H,t;tC\t~ l = C2) as an HNN- extension, where C\ and Ci are 
isomorphic finite subgroups of H . 

The proof of the characterization of context-free groups as finitely generated virtually free 
groups depends heavily on the Stallings Structure Theorem. A consequence of the geometric 
characterization of context-free groups is that every finitely generated subgroup of a context-free 
group is either finite or has more than one end. This opens the way to a proof by induction but 
needs the notion of accessibility. A finitely generated group is accessible if the process of taking 
repeated splittings as in Stallings' theorem must halt after a finite number of steps. That is, one 
splits G as H*cK or as an HNN-cxtcnsion (H, t : tGit" 1 = C2) according to the theorem and then 
splits H and K or just H in the HNN case, etc. Accessibility of context-free groups is needed to 
complete the characterization of context-free groups as virtually-free groups. (See Thcorcm l3.160 

Senizergues [95| proved the following result. 

Theorem 4.7 (Senizergues). If G is a context-free group then there are only finitely many con- 
jugacy classes of finite subgroups ofG. 

Linnell [62| proved that any finitely generated group with only finitely many conjugacy classes 
of finite subgroups is accessible. In conjunction with Senizergues' theorem this shows that any 
context-free group is accessible. Dunwoody [28| later proved that all finitely presentable groups 
are accessible. Recall that Anisimov and Seifert proved that context-free groups are finitely pre- 
sentable, (See the comments after Theorem 13. 191 Note that there exist finitely generated groups 
that are not accessible (see (29[). 

4- 3. Graphs with finitary end structure 

We have seen that e(Z 2 ) = 1 while e(T2) = 00. Later, in the section on monadic logic, we shall 
see that there is a precise sense in which, from the point of view of logical complexity, the Cayley 
graph of Z 2 is infinitely more complicated than the rooted infinite binary tree T2. So the number 
of ends is not a good measure of logical complexity but it turns out that we can still use ends to 
measure complexity. 




A very powerful result of Stallings [96| is the Stallings Structure Theorem. 
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Definition 4.8. Let T be a finitely generated graph. Denote by c(T) the number of end- 
isomorphism classes of connected components of T \ T n over all components and all n > 1. 
An end-isomorphism between connected components C of T \ T n and C of T \ T n > is a labelled 
graph isomorphism which additionally maps the points of r„ at distance n from vq to the points of 
T n i at distance n' from vq (thus respecting the end structure). Note that although we undirected 
the graph to define the connected components, we are using the directed structure of T to define 
end- isomorphisms. 

Example 4.9. (compare with Example 14.41) . 

(a) Let r be a finite graph. The number of all connected components of L \ L„, n > 1, equals the 
number of all connected components of r\ri,L\L2, . . . ,L\L,i_i, where d — max{dist(u, vq) : 
i' G y(L)}, and is therefore finite. It follows that c(L) < oo. 

(b) Let L = T2 be the rooted infinite binary tree, say with label on left successor edges and label 
1 on right successor edges. Then for every n S N and every component C of L \ L„ the graph 
C is a rooted infinite binary tree isomorphic to L. Thus c(L) = 1. 

(c) Let r be the Cayley graph of Z with respect to the standard presentation. Recall that L is 
the infinite line (see Figure [5]) with a directed edge labelled by x from vertex x n to vertex 
x n+i f or a jj n g If we remove a ball T r ,r > 1, then there are always two components. Call 
these components the "left" component and the "right " component. These two components 
are not isomorphic as labelled graphs since edges with label x go from vertex x n to vertex 
x n+1 . However, all right components are isomorphic to each other and all left components are 
isomorphic to each other. Thus c(F) = 2. 

(d) Let r be the Cayley graph of Z 2 with presentation (x,y;[x,y\) (see Figured]). Then, for 
every non-negative integer n the ball of radius n centered at the identity is the "square" 
r n = {x p y q : \p\ + \q\ < n}. It is clear that the graphs r \ T n are pairwise non-isomorphic 
(look at the finite boundaries!) so that c(T) = 00. 

Definition 4.10. A finitely generated graph T has finitary end-structure if c(T) < 00. A finitely 
generated graph is context-free if there exists a pushdown automaton M. such that T is label- 
isomorphic to T(M). 

It turns out that there is a characterization of finitely generated graphs with finitary end- 
structure. 

Theorem 4.11 (Mullcr-Schupp). Let T be a finitely generated graph. Then T has finitary end- 
structure if and only if T is context-free. 

The necessary condition of the theorem is the "easy part" while the sufficient condition is 
"hard". An analys is of the proof shows that finitely generated graphs T with c(T) < 00 are "very 



treelike" (see also [101[). Indeed, T contains a rational subtree of finite index in the sense that 
there is a subtree T of T defined by a finite automaton such that every vertex of T is within a 
fixed distance from some vertex of T. Putting the characterization of graphs T with c(T) < 00 
together with the characterization of context-free groups we have the following result. 

Corollary 4.12. Let G be a finitely generated group and let T be the Cayley graph of any finitely 
generated presentation of G. Then c(T) < go if and only if G is virtually free. 



5. Second-order monadic logic, the Domino Problem, and decidability 

5.1. Second-order monadic logic and the theorems of Biichi and Rabin 

The reader is probably familiar with first- order logic in which the quantifiers 3 (there exists) 
and V (for all) range only over individual elements of a given structure. The first-order language 
for a structure includes the quantifiers, variables x, y, z, . . . for individual elements and the Boolean 
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connectives -> (negation), V (or), and A (and). There are function and relation symbols for the 
operations and relations of the structure, including; the relation of equality. For more on first-order 
logic see the monograph by Enderton [30| . 

Example 5.1 (Group axioms). The usual axioms which define a group are expressible in first- 
order logic. A quadruple (G, , 1g), where G is a set with a binary function symbol *, a unary 
function symbol , and a 0-ary constant symbol lg, defines a group provided that: 

• \/xiy\/z[{x * y) * z = x * (y * z)\ (associative property); 

• \/x[x * 1g = 1q * x = x] (existence of an identity element); 

• \/x[x * x" 1 = x~ x * x = 1q] (existence of inverse elements). 

In monadic second-order logic, one also has variables and quantifiers ranging over arbitrary 
subsets of the structure. The term "monadic" refers to the fact that we can quantify only over 
subsets of the given structure, and not over relations. Second-order logic with variables for ar- 
bitrary relations is sometimes called full second-order logic to distinguish it from the monadic 
version. 

Example 5.2 (Pcano axioms). Consider the language of second-order Peano axioms for arithmetic 
in which we have a unary function symbol s for the successor function, a constant symbol 0, the 
set membership symbol G, the relation C of set inclusion, and equality relation for both individual 
and set variables. The axioms are: 

• V.t-i[s(.t) = 0] 

• \/y3x[y 7^ => y = s(x)] 

• \/x\fy[s(x) = s(y) =>• x = y] 

• VA[[0 G X A Vx(x el4 s(x) e X)] => \/y[y 6 X]} (mathematical induction). 

In standard second-order logic, these axioms define N with the successor function up to iso- 
morphism. This theory is sometimes denoted by 515, the theory of one successor function. 



Biichi [ll[ introduced the theory of finite automata on infinite inputs to prove the following 
result. 

Theorem 5.3 (Biichi). The monadic second-order theory SIS is decidable. 

We next want to consider the monadic theory S2S of two successor functions, that is, the 
monadic theory of the rooted infinite binary tree T-i. Individual variables and quantifiers can 
actually be eliminated since when a set has exactly one element is definable in the logic and we 
often adopt this point of view. Also, equality between sets is definable in terms of set inclusion. 
The set of vertices of the rooted infinite binary tree T% can be viewed as the set {0, 1}* of all finite 
words on {0, 1}. We have a constant for the root of the tree (which corresponds to the empty 
word e) and two set- valued successor functions, and 1. If S denotes a set of vertices then 

50 = {v0 :veS} and 51 = {vl : v e S}. 

We also have the binary relation C of set inclusion. 

In 1969 Rabin [89[ developed the theory of finite automata working on infinite trees and proved 
the following result. 

Theorem 5.4 (Rabin). The monadic second-order theory 525 is decidable. 

As a consequence of Rabin's theorem, the monadic second-order theory SnS of n successor 
functions is also decidable since it can be interpreted in 525. Note that the above theories are 
about the geometry of the underlying graph. Analogously then, we can define the second-order 
monadic theory of any finitely generated graph T = (V, E, E, vq). We thus have again a constant 
for the origin of the graph vq and for each a € E we have a set-valued successor function where 
Sa = {v E V : 3u e S such that (u, a, v) G E} for all 5 C V. 
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5.2. The Domino Problem 

Rabin's theorem is one of the most remarkable positive results on decidability. An important 
negative result is the unsolvability of the Wang Domino Problem in the plane. Whether or not it 
is possible to tile the plane with copies of a fixed finite set of square tiles with colored edges was a 
question raised by Wang [98| in the late 1950s. Of course, when one places a tile next to another 
one, the colors on the matching edges must be the same. Wang showed that the origin- constrained 
problem is undecidable. In this version there is a fixed initial tile which must be used first. Indeed, 
fixing one tile is enough to show that one can directly simulate the Halting Problem for Turing 
machines in this context. Given a Turing machine T one can write down a set of tiles such that 
one can tile the entire plane if and only if T halts when started with a blank tape. The general 
Tiling Problem without an origin constraint was proved undecidable by Berger 0] in 1966. In 
1971, Robinson [9(| found a simpler proof of the undecidability of the general problem in the 
Euclidean plane. 

This problem can be reformulated in terms of coloring vertices as follows. Let T be the Cayley 
graph of the standard presentation Z 2 = (x,y;[x,y]} of the free abelian group of rank 2. Let 
C = {ci, C2, . . . , Cfc} be a finite set of colors. The standard neighborhood of a vertex v in T consists 
of v and its four neighbors: vx,vx~ l ,vy, and vy~ l (sec Figure [T2"]) . 



vy 
v 



vx 



-i — > V — > vx 



vy 



Figure 12: The standard neighborhood of a vertex v in the Cayley graph of 7? = (x,y\ \x,y\). 

We are also given a set J- of forbidden patterns where a pattern p £ C 5 is a coloring of the 
vertices of the standard neighborhood with colors from C. The Domino Problem for Z 2 is the 
following decision problem: given a pair (C, J-) as above, can all the vertices of the Cayley graph 
r be colored so that there are no forbidden patterns? Note that since Y can be viewed as the 
dual graph of the tessellation by squares, this version is easily seen to be equivalent to the original 
formulation in terms of square tiles. 

Our reformulation of the Domino Problem applies to an arbitrary finitely generated group 
G. Also, the Domino Problem is easily expressible in terms of the monadic second-order logic 
of the Cayley graph Y of G with respect to the given presentation. A tuple (Ci, C2, . . . , Cfc) of 
sets of elements of G is a disjoint cover of G if every element of G belongs to exactly one of the 
Ci. (A disjoint cover differs from a partition only in that some of the Ci may be empty.) We 
need only say that there is a disjoint cover (Ci, C2, ■ ■ ■ , Ck) of the vertices corresponding to the 
colors ci,C2,...,Cfc such that there are no forbidden patterns. For example, if the i-ih pattern 
in T centered at v has color c v at v and colors c x ,c x -i,c y , and c y -i at vx, vx^ 1 , vy, and vy^ 1 
respectively, we abbreviate this as pi, and we must say that such a pattern does not occur. We 
can write this as: 

3Ci3C a • • ■ 3C k Vv[[\/[v e C t ] A{/\[veG\^v<£ C ]} A [ f\ -, Pi )}. 

i i<j Pi6-F 

Note that from the point of view of logical complexity, measured in terms of alternation of 
quantifiers, the sentence above is very simple. It consists of one block of existential set quantifiers 
followed by one universal individual quantifier and such sentences are already undecidable. There 
is thus a precise sense in which the monadic logic of the Cayley graph of Z 2 is infinitely more 
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complicated than the monadic logic of the infinite binary tree, where the entire monadic theory is 
decidable. 



Recently, Margenstern [68( (see also [67[ for a shorter account) proved that the general Tiling 
Problem of the hyperbolic plane is undecidablc by using a regular polygon as the basic shape of 
the tiles. Robinson raised this problem in the above mentioned paper and in 1978 he proved that 
the origin-constrained problem is undecidablc for the hyperbolic plane [9l| . The fundamental 
group of a closed orientable surface of genus 2 has a presentation G2 = {a, b, c, d; [a, b][c, d]). The 
corresponding Cayley graph induces a tessellation of the hyperbolic plane by regular octagons and 
every vertex is on exactly eight such octagons (thus the graph is self-dual). We can reformulate 
Margenstern's undecidability result in group-theoretical language as follows. 

Theorem 5.5 (Margenstern). The Domino Problem for the surface group G2 is undecidablc 

5.3. Decidability of monadic second-order theory for context-free groups 

Recall that a finitely generated group G has context-free Word Problem if and only if G is 
virtually free (see Theorem I3.16[) . Now the Cayley graph of a finitely generated virtually free 
group has a regular tree of finite index. Namely, the subgraph corresponding to the Cayley graph 
of the free subgroup of finite index. In this case one can reduce the monadic theory of G to the 
monadic theory of the subtree. As a consequence, we have the following result [78| . 

Theorem 5.6 (Mullcr-Schupp). The monadic second-order theory of a Cayley graph of a context- 
free group is decidable. 

Corollary 5.7. The Domino Problem for context-free groups is decidable. 
Kuske and Lohrey [60( have recently proved the converse to Theorem 



Theorem 5.8 (Kuske-Lohrcy). If the monadic second-order theory of a Cayley graph of a finitely 
generated group is decidable, then the group is context-free. 

In the section on graphs with unitary end structure, we mentioned that all such graphs also 
have a regular subtree of finite index. Thus we have the following result from (78| . 

Theorem 5.9 (Muller-Schupp). Let T be the complete transition graph of a pushdown automaton. 
Then the monadic second-order theory ofT is decidable. 



6. Cellular Automata on Groups 

Cellular automata were introduced by von Neumann [I2I |86| who used them to describe theo- 
retical models of self-reproducing machines. Although originally defined on the lattice of integer 
points in Euclidean plane, cellular automata can be defined over any group. 

Let G be a group, called the universe, and let K be a finite alphabet called the set of states (or 
colors). Denote by E the set of all maps a: G — >• E, called configurations. When equipped with 
the prodiscrete topology, that is, the product topology obtained by taking the discrete topology 
on each factor E of E G = ElgeG ^' ^ ne configuration space becomes a compact, Hausdorff, totally 
disconnected topological space. There is a natural continuous left action of G on E G given by 
ga(h) = a(g~ 1 h) for all g,h £ G and a £ E G . This action is called the G- shift on E G . 

Definition 6.1. A map C: E G — s> E G is called a cellular automaton provided there exists a finite 
subset McG and a map fx: E M — > E such that 

C{a){g)=ii{{g- 1 a)\ M ) (6.1) 

for all a £ E G and g £ G, where (-)|a/ denotes the restriction to M. The subset M C G is called 
a local neighborhood (or memory set) for C and \i is the associated local defining map. 
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Example 6.2 (The majority action on Z). Consider G = Z, E = {0,1}, M = { — 1,0,1} and 
H : E M = E 3 E denned by 



iu(a_i,oo,ai) = 



1 if a_i + ao + a i > 2 
otherwise. 



Figure [13] illustrates the behavior of the corresponding cellular automaton C: E z — > E z . Note that 
C is surjective but not injective. 
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Figure 13: The cellular automaton denned by the majority action on Z. 



Example 6.3 (Hedlund's marker jig). Let G = Z, E = {0,1}, M = {-1,0,1,2} and fj,: E M = 
E 4 -> E defined by 



l-o if (a_i, 01,02) = (0,1,0) 



"0 



otherwise. 



The corresponding cellular automaton C : E z — » E z is a nontrivial involution of E z . It is described 
in Figure fT4l 
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Figure 14: The cellular automaton defined by the Hedlund marker. 



Example 6.4 (Conway's Game of Life). Let G = Z 2 , E = {0,1}, AI = {-1,0, l} 2 C Z 2 and 
H : E M — > E given by 



M2/) = < 



1 if. 



51 y( 7 



or 5^ y(m) = 4 and y((0, 0)) = 1 



meM 

otherwise 



3.2) 



for all y <E E M . The corresponding cellular automaton C : E z — > E z describes the Game of 
Life due to Conway. One thinks of an element g of G = Z 2 as a "cell" and the set gM (we 
use multiplicative notation) as the set consisting of its eight neighboring cells, namely the North, 
North-East, East, South-East, South, South- West, West and North- West cells. We interpret state 
as corresponding to the absence of life while state 1 corresponds to the presence of life. We thus 
refer to cells in state as dead cells and to cells in state 1 as live cells. Finally, if a € E z is a 
configuration at time t, then C{a) represents the evolution of the configuration at time t + 1. Then 
the cellular automaton in (|6.2p evolves as follows. 
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• Birth: a cell that is dead at time t becomes alive at time t + 1 if and only if three of its 
neighbors are alive at time t. 

• Survival: a cell that is alive at time t will remain alive at time t + 1 if and only if it has 
exactly two or three live neighbors at time t. 

• Death by loneliness: a live cell that has at most one live neighbor at time t will be dead at 
time t+1. 

• Death by overcrowding: a cell that is alive at time t and has four or more live neighbors at 
time t, will be dead at time 

Figure [T5l illustrates all these cases. Note that C is not injective and it can be shown that C is not 
surjectivc cither. 
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Figure 15: The evolution of a cell in the Game of Life. The symbol * represents any symbol in {0, li- 



lt easily follows from the definition that every cellular automaton C : £ G — > £ G is G-equi variant, 
i.e., C(ga) = gC(a) for all g £ G and a € £ G , and is continuous with respect to the prodiscrctc 
topology on £ G . The Curtis-Hedlund Theorem ([13], [III Theorem 1.8.1]) shows that the converse 
is also true. 

It immediately follows from topological considerations and the Curtis-Hedlund Theorem that 
a bijective cellular automaton C : S G —5- S G is invertible, in the sense that the inverse map 
C _1 : S G — > S G is also a cellular automaton. 

A map C : S G — s- S G is called pre-injective (a terminology due to Gromov [4l|) if whenever two 
configurations a, f3 € S G differ at only finitely many points (that is, the set {g € G : a(g) ^ P(g)} 
is finite) and C(a) = C(/3), then a = j3. Clearly pre-injectivity is a weaker form of injectivity. 

Moore and Myhill proved that for G = Z d , d > 1, a cellular automaton C: S G — > S G is 
surjective if and only if it is pre-injective. Necessity is due to Moore and sufficiency is due to 
Myhill. This result is often called the Garden of Eden Theorem. Regarding a cellular automaton 
as a dynamical system with discrete time, a configuration which is not in the image of the cellular 
automaton can only appear as an initial configuration, that is, at time t = 0. This motivates 
the biblical terminology. In 1993 Machi and Mignosi [64| extended the Garden of Eden theorem 
to finitely generated groups of subexponenti al g rowth (cf. the end of Section I3.2[) and, finally, 
Ceccherini-Silberstein, Machi and Scarabotti jl7| (see also Gromov [4Cjj ]) further extended it to all 
amenable groups. 

Recall that a group G is said to be amenable, a notion going back to von Neumann [85| , if there 
exists a left-invariant finitely additive probability measure on G, that is, a map m: V(G) —± [0, 1] 
such that m(G) = 1, m(A U B) = m(A) + m(B) - m(A n B) and m(gA) = m(A), for all 
A,B 6 V(G) and g € G. Finite groups, abelian groups, and more generally solvable groups, 
groups of subexponential growth are amenable groups. On the other hand the free nonabelian 
groups are non-amenable. 

Based on examples due to Muller [77j , in [ItJ it is shown that if the group G contains a free 
nonabelian group (and is therefore non-amenable, since the class of amenable groups is closed under 
the operation of taking subgroups), then there exist examples of pre-injective (resp. surjectivc) 
cellular automata on G which are not surjective (resp. not pre-injective). Finally, Bartholdi in 
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2010 [4j (see also Theorem 5.12.1 in [16|) proved the converse to the amenable version of Moore's 
theorem in [l7| . namely that if every surjective cellular automaton C: E G — > E G is pre-injective, 
then the group G is amenable. This yields a new characterization of amenability in terms of 
cellular automata. 

Following Gottschalk [36j , we say that a group G is surjunctive provided that for every finite 
set E every injective cellular automaton C: E G — > E G is surjective (and therefore bijective). It 
is an open problem to determine whether all groups are surjunctive or not. Lawton (6l| (see 
also [la . Theorem 3.3.1]) showed that all residually finite groups (in particular, all virtually free 
groups) are surjunctive. Recall that a group is residually finite provided that the intersection of 
all its finite index subgroups reduces to the trivial group (see, e.g. [ljl Chapter 2]. It immediately 
follows from the Garden of Eden Theorem for amenable groups that all amenable groups are 
surjunctive. Gromov [4l| and Weiss [9!| (see also [l6|, Theorem 7.8.1]) showed that all sofic 
groups are surjunctive. For the definition of soficity we refer to [l6|, Chapter 7]. We only mention 
that the class of sofic groups contains all residually finite groups and all amenable groups, and 
that it is not known if there are any non-sofic groups. 

One is often interested in determining whether a cellular automaton is injective or surjective. 
In particular, the following question naturally arises: is it decidable, given a finite subset M C G 
and a map p: E A/ — > E, if the associated cellular automaton C: E G — > E G defined in (|6.ip is 
surjective or not? Amoroso and Patt [l[ proved in 1972 that if G = Z the above Surjectivity 
Problem is decidable. On the other hand, Kari [52, 53, 54 1 proved that the similar problem for 



cellular automata with finite alphabet over Z , d > 2, is undecidable. His proof is based on Berger's 
undccidability result for the Domino Problem (see Section I5~2")) . It follows from the decidability of 
the monadic second-order theory of Cayley graphs of context-free groups (cf. Theorem 15. 6| that 
the Surjectivity Problem for cellular automata defined over finitely generated virtually-free groups 
is decidable. 

Indeed, that the cellular automaton is surjective is expressed by saying that for every disjoint 
cover (Ci, C2, ■ ■ ■ , C n ) of G (where Ci represents the points currently in state G E) there is a 
disjoint cover (Pi, P 2 , . . . , P n ) (the assignment of predecessor states) such that for every vertex v, 
one has v G Ci if and only if the points in the neighborhood of v are in the correct P-sets for the 
local defining map p to assign state to w. This fact is easily expressible as a monadic second- 
order sentence. It similarly follows that the Injectivity and Bijectivity Problems are decidable for 
cellular automata on finitely generated virtually-free groups. 

The following natural question is open. 

Question. Are there any finitely generated groups which are not virtually free but for which the 
Surjectivity, Injectivity or Bijectivity Problems are decidable? 



7. Finite automata on infinite inputs and infinite games of perfect information 

7.1. Biichi acceptance and regular languages in E N 

As mentioned in the Introduction, monadic sentences are too complicated to deal with directly. 
The theorems of Biichi (cf. Theorem l5.3p and of Rabin (cf. Theorem 1 5. 4[) are proved by developing 
a theory of finite automata working on infinite words and infinite trees respectively. Let w = 
wqWi . . . WiWi+i • • • G E N be an infinite word. (All our infinite words are infinite to the right.) In 
Buchi's original paper, a nondeterministic finite automaton working on a word w G E N is a tuple 
A = (Q, S, qo, 6, F) exactly as in the case of automata on finite words (cf. Section 12^4]) . Thus, as 
usual, Q is a finite set of states, E is a finite alphabet, go G Q is the initial state, 8 : Q x E — > V(Q) 
is the transition function and F C Q is a set of final states. A run of A on w is a map p : N —¥ Q 
such that p(Q) = qo and p(i + 1) G 6(p(i),Wi) for all i G N. We must now define when the 
automaton A accepts w G E N , which we write as A h w. The definition of Biichi acceptance is 
that A h w if there exists a run p of A on w such that some state from F occurs infinitely often. 
As in the case of finite words, we call the set 

L{A) = {w G E N : A h w} C E N 
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the language accepted by A. A subset L C £ N is a regular language if it is the language accepted 
by some finite automaton. 

Example 7.1. Let £ = {a, b}. We describe a finite automaton which accepts those infinite words 
w £ £ N containing b only a finite number of times. Let A = (Q, £, qoi -P) be a finite automaton 
where Q = {q b , q a , q c , q r }, <?o = qb, F = {q c } and 

S(q b ,a) = q a , 6{q b ,b) = q b , 
6(q a ,a) = {q a , q c }, 5(q a , b) = q b , 
S(q c ,a) = q c , S(q c ,b) = q r , 
S(q r ,a) = 6(q r ,b) = q r . 

The automaton is illustrated in Figure [TBI and it works in the following way. When in state q b , 
the automaton goes to q a on reading a and remains in q b on reading b. On reading a b in the 
state q a it goes to state q b . On reading an a in q a the automaton can either remain in state q a or 
"guess" that it will see no 6's in the future by going to the "check" state q c . In q c the automaton 
remains in q c as long as it sees only a's but goes to the reject state q r if it ever reads a b. Once in 
q r the automaton always remains in q r on either input. Since F = {q c }, in any accepting run the 
automaton must have guessed at some time that no more &'s occur and must then always remain 
in q c , thus seeing no more b's. And for any w € £ N containing only finitely many &'s there is an 
accepting run. 




a 

a 




a, b 



Figure 16: The automaton accepting the infinite words w S {a, 6} N containing only a finite number of b's. 

The overall goal is to associate with each monadic sentence 4> of SI S a finite automaton A^ 
such that (j) is true if and only if L{A$) ^ 0. In order to do this we need to establish the closure 
of regular languages under the three operations of union, complementation, and projection. These 
operations correspond to the logical connectives V, -i, and 3 respectively. If £ and E are alphabets 
and 7r : E — > £ is a map then tt induces a function 7? : E N — > E by letter-by-letter substitution. 
If L C E N is a language, then tt(L) C E is the projection of L under tt and we need to know that 
if L is a regular language over E then tt (L) is a regular language over E. 

The closure of regular languages with respect to the operation of union is easy to establish in 
essentially any model of finite automata. Also, projection is "easy" for nondeterministic automata, 
even on infinite words, and "hard" for deterministic automata. Suppose that tt : E — > E is a 
function inducing the projection tt : E" — > E and that A = (Q, E, 5, qo, F) is a nondeterministic 
automaton with alphabet E. To accept the projection of the language accepted by A, we define a 
nondeterministic automaton A which, on reading a letter a £ £ can make any transition that A 
can make on any preimage of a. Formally, 

A= (V(Q), %5,{q }, V{F)) where S(S, a) = \J \J S(q,a). 
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Note that even if we started with a deterministic automaton A, the automaton A is nondeter- 
ministic. 

7.2. Muller acceptance 

In general, the closure of regular languages with respect to complementation is "hard" for 
nondctcrministic automata, and regular languages in E N recognized by using Biichi acceptance 
generally require using a nondctcrministic automaton. The power of automata on infinite inputs 
is very sensitive to the acceptance condition used. Muller [76| introduced the concept of Muller 
acceptance, which is the most general type of acceptance commonly used. 

Definition 7.2. A nondetcrministic Muller automaton is a tuple A = {Q, £, 5, qo, J 7 ) where Q, £, 5 
and go are exactly as for a nondeterministic finite automaton but T C V(Q). Let w 6 S N be a 
word. If p is a run of A on w then we denote by Inf(p) the set of states occurring infinitely often 
in p. Then A accepts w if there exists a run p of A on w such that Inf (p) G T . 

Remark 7.3. If we compare Biichi acceptance with Muller acceptance, we have that the set of final 
states F C S is now replaced by the "accepting" family JF . Moreover w G S N is Biichi-accepted if 
Inf(p) (~l F ^ 0, while it is Muller- accepted if Inf(p) G T. 

The following result was conjectured by Muller and then proved by McNaughton [72| . 

Theorem 7.4 (McNaughton). For any nondeterministic automaton on infinite words using Muller 
acceptance, there is an equivalent deterministic automaton using Muller acceptance. 

While the negation of a Biichi acceptance condition is not a Biichi condition, the negation of a 
Muller acceptance condition J- is again a condition of the same type, namely the Muller condition 
defined by the accepting family V(ff) \ T. For a deterministic automaton A = (Q, S, S, qo, J 7 ) 
using Muller acceptance to accept the language L(A) we have 

Y,*\L{A) = L(-~A) where ->A = (<?, E, 5, q , V(Q) \ F). 

In short, -<A is obtained from A by simply complementing the accepting family. 

McNaughton's theorem thus proves that the class of regular languages of infinite words is 
closed under complementation. Proving McNaughton's theorem from scratch is not easy and it is 
an accident that determinizing the nondeterministic automaton of Example 17.11 is easy. 

Example 7.5. Let £ = {a, b}. We now present a deterministic finite automaton A using 
Muller acceptance which accepts exactly those words w G S N containing b infinitely often. Let 
A = (Q, S, S, qo, J-) be the finite automaton in which Q = {q a ,qf,}, £ = {a, b}, qo = 9a, 
J 7 = {{lb}, {q a ,qb}} and 

6(q a ,a) = q a , S(q a ,b) = q b , 
5{q b ,a) = q a , S(q b ,b) = q b . 

The automaton is illustrated in Figure [T7] and it works in the following way. The states q a 
and q b record which letter has just been read. On a word w G S N containing b infinitely often the 
set of states occurring infinitely often must be exactly {q a ,qb} in the case that both letters occur 
infinitely often or {q b } in the case that only b occurs infinitely often. Since T consists of these two 
sets, the automaton accepts exactly the desired words. Note that -^A — (Q, E, 5, qo, {{q a }}) is a 
deterministic automaton using Muller acceptance which accepts exactly those words containing b 
only finitely many times (cf. Example 17. ip . 

Deciding the Emptiness Problem for non-deterministic Muller automata is easy. Given A with 
underlying graph T, the language L(A) =/= if and only if there is a path in T from the initial 
state to a cycle containing exactly the states in some set S G F . 
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? = {{<ib},{q a ,qb}} 



Figure 17: The automaton accepting the infinite words w g {a, 6} N containing an infinite number of b's. 
7.3. Rabin's theory 

We now turn to considering automata on the infinite binary tree T%. Recall that each vertex 
of T2 is described by a finite word over the set {0, 1} of the two possible directions. For a 
nondctcrministic automaton with alphabet E working on T2, a possible input a consists of an 
element a G E T2 which can be described as a copy of T2 with all vertices labelled from E. In 
Rabin's model, a nondctcrministic automaton is a 5-tuplc A = (Q, E, 5, <?o, F), where Q, E, go an d 
T are defined as in Section [7^1 The transition function is of the form (5 : Q x E — > P(Q x Q). 
The automaton starts at the root e in the initial state go- A copy of the automaton at a vertex v 
always sends one copy to the left successor of v and one copy to the right successor of v. 

Example 7.6. If one has 

<5(<7o,a) = {(91,^3), («2, ?o)}) 

then when the automaton is in state go reading the letter a, it can send one copy to the left in 
state qi and one copy to the right in state q^, or it can send one copy to the left in state q^ and 
one copy to the right in state go- Note that both "and" and "or" occur in the description of the 
transition function. This situation is illustrated in Figure 1181 




Figure 18: An instance of the transition function in a Rabin automaton. The drawing convention is that the broken 
line visualizes the copy to the left, while the continuous line visualizes the copy to the right. 

We must now define what it means for an automaton A to accept an input a, for which we 
write ^Ihctas usual. An infinite path 7r through Ti is a path starting at the origin e such that each 
vertex in ir has exactly one successor in it. Note that it G {0, 1} N and there are thus uncountably 
many distinct infinite paths through the tree. A run p of A on a is an element in Q T2 , that is, a 
labelling of T2 by states from Q such that for each vertex v G T2 we have 

(p(vQ),p(vl))ed(p(v),a(v)). 

We will again use Muller acceptance although Rabin used a different but equivalent condition. So 
we specify a family J- C V{Q). Given a run p and a path 7r, we define Inf(p, tt) to be the set of 
states in p which occur infinitely often along the path ir. Finally, 

Aha if 3pV7r[Inf(/9,7r) G T]. 

In short, for every path 7r the set of states occurring infinitely often along tt must be some set S 
in the accepting family T . Note that S can vary with different paths. 
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Example 7.7. We extend Example 17.11 Suppose again that S = {a, b} and we now want an 
automaton which accepts a G £ T2 exactly if a contains some infinite path it on which b occurs 
only finitely often. Let A = (Q, £, S, q , T) where Q = {q a , qb, q d }, qo = q a , ? '= {{qa}, and 

$(q a ,a) = {(q d ,q a ), (q a ,q d )},S(q a ,b) = {{q d ,qb), (qb,Qd)}, 
S(qb,a) = {(q d ,q a ), (q a ,q d )},S(q b ,b) = {(q d ,q b ), (qb,q d )}, 
S(q d ,a) = S(q d ,b) = {(q d ,q d )}. 

The automaton is illustrated in Figure [19] and works in the following way. Its overall strategy 
is to make a nondeterministic choice of the path n. On reading an a in state q a , the automaton 
sends a copy in the "don't care" state q d in one direction and a copy in q a in the other direction. 
On reading a b in state q a , the automaton sends a copy in the "don't care" state q d in one direction 
and a copy in q the other direction. The state qb functions similarly. If the automaton is in the 
"don't care" state q d , it is not on the chosen path and so sends copies in q d in both directions on 
reading cither letter. It is easy to see that A h a if and only if a does contain an infinite path 
with only finitely many 6's. 




Figure 19: The Rabin automaton defined in Example 17. 71 



7.4- Infinite games of perfect information 

Deterministic automata on trees are not very powerful and nondcterminism is essential. Rabin's 
proof of the closure of regular languages under complementation was very difficult. We now know 
that the best way to understand automata on infinite inputs is in terms of infinite games of perfect 



information, as introduced by Gale and Stewart [34 



Let £ be a finite alphabet, let £ denote the set of all infinite words over £, and let W be a 
subset of £ N . We consider the following game between Player I and Player II. Player I chooses a 
letter o\ G £ and Player II then chooses a letter 02 G £. Continuing indefinitely, at step n Player 
I chooses a letter a-in-i G £ and Player II then chooses a letter o<i n G £. The sequence of choices 
defines an infinite word w = crio^ • • • o~ n ■ ■ ■ G £ N . Player I wins the game if w G W and Player II 
wins otherwise. The basic question about such games is whether or not one of the players has a 
winning strategy that is, a function <fi : £* — > £ such that when a finite word u has already been 
played, the player then plays (f>(u) G £ and always wins. Using the Axiom of Choice, it is possible 
to construct winning sets such that neither player has a winning strategy, but this cannot happen 
if the set W is not "too complicated" . 

Example 7.8. We show that if the set W is countable and |£| > 2 then the second player has 
a winning strategy by applying Cantor's diagonal argument. Let Wi = u>i,iu>i,2 • ■ ■ Wi,n • • • be the 
i-th word in W. On his turn, play 2k, Player II simply plays a letter different from W2k,2k- Thus 
the word resulting from the set of plays is not in W. Note that this simple example shows that 
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strategies need not at all be effectively computable. Since the W{ are infinite words, even a single 
such word need not be computable since VV is an arbitrary countable subset of S N . 

The set E N becomes a complete metric space by defining dist(v, w) = 2~-? for all v = v\v 2 • • • 
and w = W1W2 ■ • • ; where 7 is the least index such that Wj ^ Vj . An important theorem of 
Martin 0, El (see also [Hi Sect. 20] and 0, Sect. 6F]) shows that if the set W is a Borel set 
then one of the two players must have a winning strategy. In applying infinite games to automata, 
one needs only consider winning conditions which are Fg i<r and that such games are determined 
was proven by Davis pjl ] before Martin's general result. Given an automaton A and an input a, 
one defines the acceptance game Q(A 1 t) for A on the input a. The first player wins if A accepts 
a while the second player wins if A rejects. 

Muller and Schupp [82j defined alternating tree automata as a generalization of nondetermin- 
istic automata working on trees. In this model, the transition function has the form <5 : Q x £ — > 
C(Q x {0, 1}, where C(Q x {0, 1} is the free distributive lattice generated by all possible pairs 
(state, direction). 

Example 7.9. We consider again a nondeterministic automaton in which 

%o,a) = {(qi,q-3), (92, go)} 

as in Example 17.61 In the lattice notation we can write this as 

5(q ,a) = [( 9l ,0) A (q 3 , 1)] V [(g 2 ,0) A (g ,l)]. 

Here the symbol V stands for nondeterministic choice and A means "do both things" . 

We dualize a transition function of an alternating tree automaton by interchanging A and V 
as usual. For the example above we have: 

5(q ,a) = 0) V (q 3 , 1)] A [(g 2 ,0) V (q ,l)}. 

Converting this expression to disjunctive normal form we have: 

S(q ,a) = [( qi ,0) A faO)] V [(g 1; 0) A {q , 1)] V [(q 3 , 1) A (q 2 ,0)} V [(q 3 , 1) A (q ,l)]. 

We interpret this as saying that when the automaton is in state go reading the letter a it has a 
choice of sending one copy to the left in q\ and another copy to the left in or sending a copy 
to the left in q\ and a copy to the right in go, or a copy to the right in q 3 and a copy to the 
left in 52, or, finally, a copy to the right in q 3 and another copy to the right in qg. This is not 
a nondeterministic automaton but it is a perfectly good alternating automaton. Note that the 
automaton can send multiple copies in the same direction and is not required to send copies in all 
directions. It must, of course, send at least one copy in some direction. 

We now have a framework general enough to always be able to dualize. 

Definition 7.10. Let A ~ (Q, E, 5, qo, J-) be an alternating automaton on the rooted infinite 
binary tree. Then the dual automaton of A is 

A = (<5,S,5,go,-^) 

where S is obtained by dualizing the transition function 5, and the accepting family is J- = V(Q)\J-. 

It is clear from the definition that the dual of A is just A. One must carefully define the 
acceptance game Q(A,t) of A on an input a (for details see [82j]). That this game is determined 
follows from Davis' theorem. In the alternating framework, it is easy to check that a winning 
strategy for the second player in Q (A, t) is a winning strategy for the first player in the acceptance 
game G(A, t) for the dual automaton. Thus complementation is easy for alternating automata 
and the following theorem is a consequence of pure dcterminacy. 
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Theorem 7.11 (The Complementation Theorem) . If A is an alternating tree automaton accepting 
the language L(A) then the dual automaton A accepts the complementary language ->L(A). 

Of course, something must be hard for alternating automata and it is the operation of pro- 
jection. The argument for nondcterministic automaton fails completely because there may be 
multiple copies of the automaton at the same vertex of the tree. So we must prove that given 
an alternating automaton, there is a nondcterministic automaton accepting the same language. 



Gurevich and Harrington [42J made a fundamental contribution to understanding automata on 
infinite inputs by showing that a winning strategy in the acceptance game for a nondetermin- 
istic automaton depends only on a finite amount of memor y ca lled the later appearance record. 



This is called the Forgetful Determinacy Theorem (see 4^, 103j )- Muller and Schupp [82[ used 
the later appearance record to prove the Simulation Theorem which states that there is an effec- 
tive construction which, given an alternating automaton, produces a nondcterministic automaton 
accepting the same language. 

Given the Complementation and Simulation theorems, most results have short conceptual 
proofs. As an illustration, we present a proof of McNaughton's theorem. 

Proof of Theorem |7.^[ There is a natural notion of an automaton which is alternating but still 
deterministic. Namely, one with no V's in its transition function. The Simulation Theorem 
shows that if we start with a deterministic alternating automaton, then the simulating ordinary 
automaton is a deterministic automaton. If A is a nondcterministic automaton on the line (i.e. 
\D\ = 1), using Muller acceptance, then A has only V's in its transition function. Then its dual 
automaton A has only A's in its transition function and therefore is a deterministic alternating 
automaton. By the Simulation Theorem we can construct a deterministic automaton A 1 on the 
line which accepts the same language L' as A. By the Complementation Theorem, L' is the 
complement of the language L accepted by A. Since A' is deterministic we obtain a deterministic 
automaton ->A' accepting the complement of L' , that is, L, by simply complementing the accepting 
family of A, thus establishing McNaughton's Theorem. □ 
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